Difference between revisions of "Appendix"

From TED Notepad
Line 18: Line 18:
  
  
*A {{definition|word}} is a non-empty sequence of {{definition|alphanums}}. Underscores may be optionally included<sup>**</sup> within words and a phrase {{string|hello_world}} is then also treated as a single {{definition|word}} within all Tools and Functions. All characters that such a {{definition|word}} can consist of are called {{definition|word letters}} or also {{definition|word characters}}. Other characters are considered to be {{definition|word delimiters}}
+
* A {{definition|word}} is a non-empty sequence of {{definition|alphanums}}. Underscores may optionally be included within words<sup>**</sup>, a phrase like {{string|hello_world}} is then also treated as a single {{definition|word}}. All characters a {{definition|word}} can consist of are called {{definition|word letters}} or {{definition|word characters}}. Other characters are called {{definition|word delimiters}} or {{definition|non-word characters}}.
  
*A {{definition|line}} is a sequence of characters, where two {{definition|lines}} are divided by a {{string|CR/NL}} sequence of characters. Note, that if {{feature|Word Wrap}} is turned on, a {{definition|line}} may be wrapped, but within all tools it will be still treated as a single {{definition|line}}. Also note, that a single {{string|NL}} or {{string|CR}} character does not divide two {{definition|lines}}.
+
* A {{definition|line}} is a sequence of characters, where two {{definition|lines}} are divided by one newline. Note that if {{feature|Word Wrap}} is turned on, a {{definition|line}} may be visually wrapped into several visual lines, but within all tools and most features it will still be treated as a single unbroken {{definition|line}}. Any current visual word-wrapping has seldom impact on how {{definition|lines}} are treated within tools and features .
  
*An {{definition|empty line}} is a {{definition|line}}, that consists only of {{definition|white-spaces}}. Therefore a {{definition|non-empty line}} is a {{definition|line}}, that contains at least one {{definition|graph}} character.
+
* An {{definition|empty line}} is a {{definition|line}}, which consists of {{definition|white-spaces}} only. Therefore a {{definition|non-empty line}} is a {{definition|line}}, which contains at least one {{definition|graph}} character. Please note that there might be many {{definition|white-spaces}} and still the {{definition|line}} would be considered {{definition|empty}}.
  
*A {{definition|paragraph}} is a sequence of {{definition|non-empty lines}}. Two {{definition|paragraphs}} are then divided by a non-empty sequence of {{definition|empty lines}}.
+
* A {{definition|paragraph}} is a sequence of {{definition|non-empty lines}}. Two {{definition|paragraphs}} are divided by a sequence of {{definition|empty lines}}. There is no such thing as empty {{definition|paragraph}}, since sequences of {{definition|empty lines}} are always grouped together when determining {{definition|paragraphs}}.
 +
 
 +
* A {{definition|sentence}} is a sequence of characters that begins with a {{definition|capital}} and ends with a Dot, a Question mark or an Exclamation mark. Example: {{string|Alice? Who the f... is Alice?}} are two {{definition|sentences}}, but {{string|Alice? Who the f... Is Alice?}} are three {{definition|sentences}}. Unfortunatelly, even {{string|How are you today, Mr. President?}} is considered as two {{definition|sentences}}.
  
*A {{definition|sentence}} is a sequence of characters that begins with a {{definition|capital}} and ends with a Dot, a Question mark or an Exclamation mark. Example: {{string|Alice? Who the f... is Alice?}} are two {{definition|sentences}}, but {{string|Alice? Who the f... Is Alice?}} are three {{definition|sentences}}. Unfortunatelly, even {{string|How are you today, Mr. President?}} are two {{definition|sentences}}, which is not very fortunate.
 
  
 
*A {{definition|column}} is a sequence of characters on a {{definition|line}}. Two {{definition|columns}} are divided by any of the {{definition|column delimiters}}. A {{definition|column}} can not exceed a {{definition|line}}. Typically, when a {{definition|line}} is divided into logical parts by a special {{definition|delimiter}} character (e.g. a Tab character), those parts are called {{definition|columns}}. {{definition|Columns}} are used to cut out a sub-{{definition|string}} from a {{definition|line}}.
 
*A {{definition|column}} is a sequence of characters on a {{definition|line}}. Two {{definition|columns}} are divided by any of the {{definition|column delimiters}}. A {{definition|column}} can not exceed a {{definition|line}}. Typically, when a {{definition|line}} is divided into logical parts by a special {{definition|delimiter}} character (e.g. a Tab character), those parts are called {{definition|columns}}. {{definition|Columns}} are used to cut out a sub-{{definition|string}} from a {{definition|line}}.

Revision as of 16:58, 11 November 2013

You see work in progress here; this section already reflects future TED Notepad version 5.4.1.1.
This section may contain incomplete, premature, or mistaken information, prone to change without notice.

The meaning of some terms used in this manual is as follows below:


  • A white-space is a Space or a Tab or another character that can not be seen but provides blank visual separator in the document. All other characters which can be seen, are called graphs.
  • An alphanum is an alpha-numeric character, i.e. a, b, ..., z; A, B, ..., Z; 0, 1, ..., 9.
    • Special characters like á (a with acute) belong to alphanums only in certain locale settings. To be able to recognize these characters as alphanums you need to use CTYPE category of a locale that supports it. TED Notepad always works with the current system locale settings.
  • A capital is any capital letter, i.e. A, B, ..., Z. These are called letters in upper letter case or simply upper case letters. Their oposites are called lower case letters and are in lower letter case or simply in lower case.
    • Special characters like á (a with acute) belong to capitals only in certain locale settings. To be able to recognize these characters as capitals you need to use CTYPE category of a locale that supports it. TED Notepad always works with the current system locale settings.



  • A line is a sequence of characters, where two lines are divided by one newline. Note that if Word Wrap is turned on, a line may be visually wrapped into several visual lines, but within all tools and most features it will still be treated as a single unbroken line. Any current visual word-wrapping has seldom impact on how lines are treated within tools and features .
  • A sentence is a sequence of characters that begins with a capital and ends with a Dot, a Question mark or an Exclamation mark. Example: Alice? Who the f... is Alice? are two sentences, but Alice? Who the f... Is Alice? are three sentences. Unfortunatelly, even How are you today, Mr. President? is considered as two sentences.


  • A char range is a sub-sequence of characters that begins and ends at the specified positions. Char range is used to cut out a sub-string from a longer column.
  • An actual insertion point (also called a cursor position) is a position of the caret in the documnet or the end of the actual selection, if any. Note, that in special cases, it is the beginning of the selection, if any. These special cases are tools/features that work backward. (e.g. Find Previous or BkSpace Word.)
  • To unique lines is to remove duplicate lines, to unify them. If lines or words are uniqued, it means that each line (word) is unique and there no two lines are of the same text.


**: See section General page of the Settings dialog.