From TED Notepad
(Redirected from Control:Definition:word boundary)
This section is up to date for TED Notepad version 188.8.131.52.
The meaning of some terms used in this manual is as follows below:
white-spaceis a Space or a Tab or another character that can not be seen but provides blank visual separator in the document. All other characters which can be seen, are called
blank characteris also a character that can not be seen but provides blank visual separator in the document. All
blank characters, but some
blank charactersas well.
alpha-numeric character, i.e. a, b, ..., z; A, B, ..., Z; 0, 1, ..., 9.
digitis any digit recognized by Unicode, i.e. 1, ..., 9, but also ¹, ², ³, etc.
capitalis any capital letter, i.e. A, B, ..., Z. These are called letters in
upper letter caseor simply
upper caseletters. Their oposites are called
lower caseletters and are in
lower letter caseor simply in
- Other types of
word capitals, where each
wordbegins with a
capitaland continues with
first capital, where the first letter is a
capitaland all others are
lower caseletters; and
mixed case, where none of the above
letter casescan be determined.
ignore caseis to ignore differences between
lower caseletters. When
ignoring case, letter a is equal to letter A, b equal to B, etc. An antonym of
ignore caseis to
match caseand an operation, that
mimic character caseis to try to alter
character caseof some text based on
character caseof the original. Currently only basic types of
character caseare recognized:
first capital. Everyting else is considered
- There are also other types of characters recognized by TED notepad:
punctuation characteris any character recognized by Unicode as meant for punctuation purposes, e.g. quotation marks.
control characteris a character from the very beginning of the ASCII table. These have special meaning and should be either avoided or treated with care.
stringis a sequence of characters. Typically, such
stringis used as a synonym for a phrase that a user have entered in a dialog. E.g. Find and Replace
stringsfrom Search and Replace dialog are always used in find/replace mechanisms.
wordis a non-empty sequence of
alphanums. Underscores may optionally be included within words, a phrase like hello_world is then also treated as a single
word. All characters a
wordcan consist of are called
word characters. Other characters are called
non-word characters. See section General page of the Settings dialog for more information about Underscores in words.
word boundaryis a
wordend. This is the place where one of the characters around is a
word characterand the other is either a
non-word characteror there is no character at all.
lineis a sequence of characters, where two
linesare divided by one
newline. Note that if Word Wrap is turned on, a
linemay be visually wrapped into several visual lines, but within all tools and most features it will still be treated as a single unbroken
line. Any current visual word-wrapping has seldom impact on how
linesare treated within tools and features .
empty lineis a
line, which consists of
white-spacesonly. Therefore a
non-empty lineis a
line, which contains at least one
graphcharacter. Please note that there might be many
white-spacesand still the
linewould be considered
paragraphis a sequence of
non-empty lines. Two
paragraphsare divided by a sequence of
empty lines. There is no such thing as empty
paragraph, since sequences of
empty linesare always grouped together when determining
sentenceis a sequence of characters that begins with a
capitaland ends with a Dot, a Question mark or an Exclamation mark. Example: Alice? Who the f... is Alice? are two
sentences, but Alice? Who the f... Is Alice? are three
sentences. Unfortunatelly, even How are you today, Mr. President? is considered as two
line columnis part of
line, which meets certain
column criteria. As these column criteria are applied to subsequent
lines, they determine a logical
columnof text over these lines. The criteria are applied to individual
linesindependently, thus possibly determining a column of text, which may be visually hard to identify. Nevertheless, for each individual
line, the column criteria are met.
- Note: A
line columnis always one solid line portion, i.e. one line column can never consist of two separate portions of the same line. This is because a
line columnis a logical part of
line, it only specifies where it begins and where it ends on each line.
- Applicable column criteria may change from feature to feature and from tool to tool, but they usually include:
- Dividing each
lineinto portions using
delimiting characters, also called
delimiters. These delimiting characters are located within each line and the line is split into portions. A splitting point occurs at any of these characters. These portions are numbered. The criteria then specify, which consequtive portions are to be selected for the line column. Note: Delimiting characters enclosing the selected portions are not included within the line column, but any delimiting characters between the selected portions are naturally included.
- Dividing each
lineinto portions using a
delimiting phrase. In contrast to the
delimiting characters, delimiting phrase is always located within each line as a whole sequence of characters, not as a set of individual and interchangeable characters. The line is split into numbered portions wherever this whole delimiting phrase is found. The criteria then specify, which consequtive portions are to be selected for the line column. Note: Delimiting phrases enclosing the selected portions are not included within the line column, but any delimiting phrases between the selected portions are naturally included.
- Taking only a portion of each
linebased on a
range of characters. A range of characters is simply a starting and an ending point within the line. All characters between the starting and ending points are selected for the line column.
- Certain combinations of the above criteria can be used to further reduce the column. For example, a set of
delimiting characterscan be used to split the line and select only the second part, and then a
range of characterscan be used to further reduce that part at its beginning and/or at its end. Note that these criteria are applied in sequence and their results compound, i.e. later criteria obey prior criteria and never try to reach outside of boundaries set by preceding criteria.
- Dividing each
- Note: A
actual insertion point(also called a
current caret location) is a position of the caret in the documnet. It is also the end of the actual selection, if any. Note that the end of the selection is where the user stops selecting the text, therefore if selecting text upwards, the selection end visually preceeds the selection beginning.