Figure 1: Information Panel for Term
Figure 2: Tree Control for Term
Figure 3: Information Panel for Widgets and Windows
[X] Term (Y)
X represents the number of documents containing the term.
Term represents the term.
Y represents the overall frequency of the term.
Clicking on the "Definition" button will open a google web search for the term.
Note: The current panel is informational only. Future work will have more features hinted at by the panel's design.
The list of terms can be manipulated to better explore them.
Terms can be ordered either alphabetically or according to term frequencies. Moreover, the sorting can be in either ascending or descending order.
Terms can be omitted from the terms presented according to:
Numbers
Stopwords
Words that are too common to be useful for automated textual analysis.
Standard stopwords, e.g. and, the, etc.
DE stopwords, terms identified for the task/domain.
Misspelled & Spelled Correctly
These are done in relation a built in spell checker.
Note:Terms could be spelled correctly but be identified as misspelled if the term in not included in the spell checker.
Frequencies less than & greater than X
The value of X can be modified. Allows paring of the list based on term frequencies. For exmaple this could help pull out more popular terms by omitting lower frequency terms. Alternatively, unique terms could be identified by omitting higher frequency terms.
X in (ID)
X represents the number of times the term occurs in the document.
ID represents the name of the document.
Terms show which documents contain them. Under the content, the associated term is highlighted with blue text.
Note: Search Overlay is independent of the associated term.