Appendix
From TED Notepad
(Redirected from Control:Definition:white-space character)
This section is up to date for TED Notepad version 6.3.1.0.
The meaning of some terms used in this manual is as follows below:
- A
white-space
is a Space or a Tab or another character that can not be seen but provides blank visual separator in the document. All other characters which can be seen, are calledgraphs
. Ablank character
is also a character that can not be seen but provides blank visual separator in the document. Allwhite-spaces
areblank characters
, but somecontrol characters
areblank characters
as well.
- An
alphanum
is analpha-numeric character
, i.e. a, b, ..., z; A, B, ..., Z; 0, 1, ..., 9.
- A
digit
is any digit recognized by Unicode, i.e. 1, ..., 9, but also ¹, ², ³, etc.
- A
capital
is any capital letter, i.e. A, B, ..., Z. These are called letters inupper letter case
or simplyupper case
letters. Their oposites are calledlower case
letters and are inlower letter case
or simply inlower case
.
- Other types of
character case
includeword capitals
, where eachword
begins with acapital
and continues withlower case
letters;first capital
, where the first letter is acapital
and all others arelower case
letters; andmixed case
, where none of the aboveletter cases
can be determined.
- To
ignore case
is to ignore differences betweenletter cases
likecapitals
andlower case
letters. Whenignoring case
, letter a is equal to letter A, b equal to B, etc. An antonym ofignore case
is tomatch case
and an operation, thatmatches case
iscase sensitive
.
- To
mimic character case
is to try to altercharacter case
of some text based oncharacter case
of the original. Currently only basic types ofcharacter case
are recognized:lower case
,upper case
,word capitals
,first capital
. Everyting else is consideredmixed case
.
- There are also other types of characters recognized by TED notepad:
- A
punctuation character
is any character recognized by Unicode as meant for punctuation purposes, e.g. quotation marks. - A
control character
is a character from the very beginning of the ASCII table. These have special meaning and should be either avoided or treated with care.
- A
- A
string
is a sequence of characters. Typically, suchstring
is used as a synonym for a phrase that a user have entered in a dialog. E.g. Find and Replacestrings
from Search and Replace dialog are always used in find/replace mechanisms.
- A
word
is a non-empty sequence ofalphanums
. Underscores may optionally be included within words, a phrase like hello_world is then also treated as a singleword
. All characters aword
can consist of are calledword letters
orword characters
. Other characters are calledword delimiters
ornon-word characters
. See section General page of the Settings dialog for more information about Underscores in words.
- A
word boundary
is aword
beginning orword
end. This is the place where one of the characters around is aword character
and the other is either anon-word character
or there is no character at all.
- A
line
is a sequence of characters, where twolines
are divided by onenewline
. Note that if Word Wrap is turned on, aline
may be visually wrapped into several visual lines, but within all tools and most features it will still be treated as a single unbrokenline
. Any current visual word-wrapping has seldom impact on howlines
are treated within tools and features .
- An
empty line
is aline
, which consists ofwhite-spaces
only. Therefore anon-empty line
is aline
, which contains at least onegraph
character. Please note that there might be manywhite-spaces
and still theline
would be consideredempty
.
- A
paragraph
is a sequence ofnon-empty lines
. Twoparagraphs
are divided by a sequence ofempty lines
. There is no such thing as emptyparagraph
, since sequences ofempty lines
are always grouped together when determiningparagraphs
.
- A
sentence
is a sequence of characters that begins with acapital
and ends with a Dot, a Question mark or an Exclamation mark. Example: Alice? Who the f... is Alice? are twosentences
, but Alice? Who the f... Is Alice? are threesentences
. Unfortunatelly, even How are you today, Mr. President? is considered as twosentences
.
- A
line column
is part ofline
, which meets certaincolumn criteria
. As these column criteria are applied to subsequentlines
, they determine a logicalcolumn
of text over these lines. The criteria are applied to individuallines
independently, thus possibly determining a column of text, which may be visually hard to identify. Nevertheless, for each individualline
, the column criteria are met.- Note: A
line column
is always one solid line portion, i.e. one line column can never consist of two separate portions of the same line. This is because aline column
is a logical part ofline
, it only specifies where it begins and where it ends on each line. - Applicable column criteria may change from feature to feature and from tool to tool, but they usually include:
- Dividing each
line
into portions usingdelimiting characters
, also calleddelimiters
. These delimiting characters are located within each line and the line is split into portions. A splitting point occurs at any of these characters. These portions are numbered. The criteria then specify, which consequtive portions are to be selected for the line column. Note: Delimiting characters enclosing the selected portions are not included within the line column, but any delimiting characters between the selected portions are naturally included. - Dividing each
line
into portions using adelimiting phrase
. In contrast to thedelimiting characters
, delimiting phrase is always located within each line as a whole sequence of characters, not as a set of individual and interchangeable characters. The line is split into numbered portions wherever this whole delimiting phrase is found. The criteria then specify, which consequtive portions are to be selected for the line column. Note: Delimiting phrases enclosing the selected portions are not included within the line column, but any delimiting phrases between the selected portions are naturally included. - Taking only a portion of each
line
based on arange of characters
. A range of characters is simply a starting and an ending point within the line. All characters between the starting and ending points are selected for the line column. - Certain combinations of the above criteria can be used to further reduce the column. For example, a set of
delimiting characters
can be used to split the line and select only the second part, and then arange of characters
can be used to further reduce that part at its beginning and/or at its end. Note that these criteria are applied in sequence and their results compound, i.e. later criteria obey prior criteria and never try to reach outside of boundaries set by preceding criteria.
- Dividing each
- Note: A
- An
actual insertion point
(also called acurrent caret location
) is a position of the caret in the documnet. It is also the end of the actual selection, if any. Note that the end of the selection is where the user stops selecting the text, therefore if selecting text upwards, the selection end visually preceeds the selection beginning.