The vi/ex Editor, Part 2: Line-Mode Addresses

	The vi/ex Editor, Part 2: Line-Mode Addresses

	Whenever you want to give an editor command that will operate on
	text that's already in the file you're editing--to delete some
	text, change lower-case letters to capitals, write to a file,
	etcetera--the editor needs to know what part of the file to go to
	work on. A few commands have their addresses built in, and most
	line-mode commands have default addresses that the editor will use
	if you don't give an address, but that still leaves a lot of
	occasions where you need to know how to give the editor an address
	and what address to give.

	Many line-mode commands are almost identical to corresponding
	commands in visual mode; many more do similar things in different
	ways. Most of the benefit of these duplicative command sets comes
	from the totally-different addressing styles of line and visual
	modes. The differing address concepts mean that an edit that
	would be difficult or impossible to do with one mode's available
	addresses can be a piece of cake with an address form found in the
	other mode.

	Since I mention "line mode" so often, you may wonder whether there
	really is a separate mode for line editing. There surely
	is--instead of filling your screen with text from the file you're
	editing, this mode gives you a colon (:) prompt for your line mode
	commands, and prints only an occasional line from the file on your
	screen. The feel of this mode is very much like giving UNIX
	commands from your shell prompt. Few people work in line mode
	these days, largely because you can give most line-mode commands
	from visual mode, but you can't give any visual-mode commands
	while you are in line mode. Or perhaps they just prefer the
	comfortable WYSIWYG feeling of seeing the text on screen, with
	changes appearing as they are made.

	But there are times when you will need to work temporarily in line
	mode. To get to line mode when you first launch the editor,
	invoke it by typing "ex" instead of "vi". To go to line mode when
	you are already in the editor's visual mode, enter "Q". To get
	back to visual mode, type "vi" followed by a carriage return.

	Wondering why I didn't put a colon in front of that command to
	return to visual mode, which is obviously a line-mode command?
	Because you don't need to type that colon when you're giving a
	command from within line mode. It may even be harmful; the rule
	is that if you type a colon at the start of a command from within
	line mode, there must be nothing between the colon and the command
	name or abbreviation. Not an address, not even a space, nothing
	at all.

	So from this point on, I will display line-mode commands without
	an initial colon, because you now know enough to type that colon
	only if you're working in visual mode. And I'll leave off the tag
	at the end of a line-mode command that reminds you to finish with
	a carriage return because you now realize that any line-mode
	command, given from either line or visual mode, has to end with a
	carriage return.

	Some of you may ask why I show line-mode command lines in
	long-winded form, with spelled-out command names and lots of
	whitespace instead of using abbreviations. For instance, the two
	command lines:

	global /^/ move 0
	g/^/m0

	are identical in their effect, and the second is surely faster to
	type, so why do I show the first form? Because the long version
	is much easier to follow when I'm demonstrating a new concept, and
	almost everything here will be new to at least some of you. And
	it's a good idea to get to know the long forms, because you'll
	soon be learning to write editor scripts, and those scripts would
	be as cryptic as APL to future maintenance programmers if you
	wrote them in terse style. When I go over the roster of line-mode
	commands, I'll tell you both the long name and one or two short
	names for each.

	Line-Mode Addressing

	A SINGLE ADDRESS is often all you need with a line-mode command.
	One address refers to just one line, which tells a command like
	delete or substitute to operate on that one line only. A command
	like insert or read, which puts something immediately before or
	after a particular line, has no use for more than one address.

	A search pattern, as discussed in the first installment of this
	tutorial, is always an acceptable line-mode address. You put the
	address at the start of the command line, before the command name
	(but after the initial colon if you are giving the command from
	visual mode), so:

	?the [cC]at? delete

	will erase the last previous line that contains the string "the
	cat" or "the Cat", while:

	/^GLOSSARY$/ read gloss.book

	puts the contents of the file "gloss.book" right after the next
	line in the file you're editing that contains only the word
	"GLOSSARY".

	There are two shorthand forms for reusing search patterns as
	addresses. Typing "??" or "//" tells the editor to use the last
	search pattern you used previously, and your choice of "??" or
	"//" will set the direction of the search, overriding the
	direction you chose the previous time you used that search
	pattern. That is, if you type:

	?the cat? yank
	// delete
	?? print

	the second command will search forward, to remove the last
	previous line containing the string "the cat", even though your
	original use of that pattern was in a backward search. The third
	command will search backward to find the line to print, which (by
	coincidence) is the direction of the original search.

	But the search pattern that those preceding abbreviations reuse
	may not be a pattern you used to search for a line. If you ran a
	substitute command after any pattern searches for lines, then the
	pattern you gave the substitute command to tell it what text to
	take out of the line is the pattern that will be reused. This is
	so even if your substitute command began with a search pattern to
	specify the line on which the substitution was to be
	performed--the search to find the pattern to be replaced within
	the line was run after the first search pattern had found the line
	to operate on, so the search within the line was the last pattern
	search run. So if you were to type:

	/the cat/ substitute /in the hat/on the mat
	?? delete

	the second command would, in this case, delete the last previous
	line containing "in the hat". To be sure that the pattern that
	gets reused is the last one used to find a line, use the
	abbreviations "\?" and "\/" to search backward and forward,
	respectively. In all other respects these work just as typing
	"??" and "//" do.

	A LINE NUMBER is also a valid line-mode address. The editor
	automatically numbers each line in the file consecutively, and
	this numbering is dynamic--that is, whenever you add or delete
	lines somewhere, the editor renumbers all the lines following the
	insertion or deletion point. So if you change some text on line
	46 in your file, and then delete lines 11 and 12, the line with
	the text you changed is now line 44. And if you then add ten new
	lines after line 17, the line with your changed text on it now
	automatically becomes line 54.

	There is never a gap or an overlap in the line number sequence, so
	the nth line in the file is always line number n; that is, the 7th
	line is always line number 7, and so on. (There are several ways
	to display these line numbers, which I will expound in a later
	tutorial installment.) To delete the 153rd line in your file,
	just type:

	153 delete

	You don't use any delimiters around a line number, or around any
	other address except a search pattern.

	There are two symbolic line numbers and one fictional one that can
	be used in line-mode addresses. As long as there are any lines in
	the buffer (that is, you haven't specified a not-yet-existent file
	to edit and failed to enter any text so far), the editor regards
	you as being `on' one of them, usually the last line affected by
	your latest command. A period or dot (.) is the symbolic address
	for this line. The last line in the file also has a symbolic
	address: the dollar sign ($). So if you should type:

	. write >> goodlines
	$ delete

	the first command would append a copy of just the line you are on
	now to a file named "goodlines", while the second would delete the
	last line in the file you are editing.

	A few commands put text immediately after the line address you
	give: the append command is one of them. In order to let them put
	their text at the very start of a file (if that is where you want
	it), these commands can take the fictitious line number zero (0)
	as their address. So, if you want to type some text that will
	appear ahead of anything already in the file, you can do it with
	either of these command lines:

	1 insert
	0 append

	(Note, though, that insert and append are among the few line-mode
	commands that cannot be run from visual mode by starting with a
	colon, because they occupy more than one line including the text
	to be put in.)

	WRITING YOUR OWN LINE ADDRESSES is possible, too. You can attach
	lower-case letters to lines as line addresses, and change the
	attachments whenever you like. You can even use a special address
	that is automatically attached to the last line you jumped off
	from.

	There are ways to mark a particular line with a lower-case letter
	of the alphabet, and those ways differ between line and visual
	modes. I'll be explaining all these ways in later installments of
	this tutorial. But once a line is marked, the line-mode address
	that refers to that line is just the single-quote character
	followed immediately by the lower-case letter with which the line
	was marked. So typing:

	'b print

	will display on the screen whatever line you have previously
	marked with the letter b, no matter where the line is in relation
	to where you are when you give the command. No need to tell the
	editor whether to search forward or backward; there can be only
	one line at a time marked with any one letter, and the editor will
	find that line regardless.

	The editor does some line marking on its own, too. Whenever you
	move from one line to another by a non-relative address, the
	editor marks the line you just left. (A non-relative address is
	one that isn't a known number of lines from where you were.) So:

	$
	/the cat/
	358
	?glossary? +7
	'b

	are all non-relative addresses, and if you give any one of them,
	the editor will mark the line you are leaving for future
	reference. Then you can return to that line just by typing two
	successive single quotes:

	"

	as a line-mode address. In theory, you can use this address with
	any line-mode command. But it is so difficult to know for sure
	when you left a line via a non-relative address that this address
	form is best saved for going back to where you were when a mistake
	moves you far away, at least until you're a wizard with this
	editor.

	MODIFYING ANY OF THESE ADDRESSES is possible, and there are two
	ways to do this. The simpler way is to offset the address a
	certain number of lines forward or backward with plus (+) or minus
	(-) signs. The rule is that each plus sign following an address
	tells the editor to go one line farther forward in the file than
	the basic address, while each minus sign means a line backward.
	So these three addresses all refer to the same line:

	35
	37 --
	30 +++++

	Not that you're likely to want to modify line-number addresses
	with counts, unless you're weak in arithmetic and want the editor
	to do the adding and subtracting for you. But the count offsets
	will work with any line-mode addresses, and are most often used
	with search patterns. In any event, there is a shorthand for
	these counts, too. A plus or minus sign immediately followed by a
	number (single or multiple digits) is equivalent to a string of
	plus or minus signs equal to that number, so that these two
	addresses are the same:

	/^register long/ ++++
	/^register long/ +4

	Take note that the "4" in the second example does not mean "line
	number 4", as it would if it appeared by itself as an address.
	After a plus or minus sign, a number is a count forward or
	backward from where the primary address lands (or if there is no
	primary address before the count, from the line you are on when
	you run the command).

	Note also that this is one of the few places in line-mode commands
	where you may not insert a blank space. The number must start in
	the very next character position after the plus or minus sign. If
	you violate this rule, the editor will uncomplainingly operate on
	some line that definitely is not the line you expected.

	The second style of address modifier is used where you want to do
	a search that's complex. Let's say you want to go forward in the
	file to delete a line that starts with "WARNING!", but not the
	first such line the editor would encounter; you want the second
	instance. Either of these command lines will do it:

	/^WARNING!/ ; /^WARNING!/ delete
	/^WARNING!/ ; // delete

	A semicolon (;) between two search patterns tells the editor to
	find the location of the first pattern in the usual way, then
	start searching from that location for the second pattern. In
	this case, the first search pattern turned up the first instance
	of a line starting with "WARNING!", and the second search pattern
	led the editor on to the second instance.

	A very significant point here is that this combination of two
	search patterns, either of which could be a line address in
	itself, does not tell the editor to delete two lines. The
	semicolon means that the first pattern is merely a way station,
	and that the single line found by the second search pattern is the
	only line to be deleted. In brief, what looks like addresses for
	two lines is actually only an address for one. (This is not what
	the official documentation for this editor says, but the
	documentation is just plain wrong on this point.)

	But that's just the start of what you can do. You are not
	restricted to just two addresses. I've used up to ten of them,
	all separated by semicolons, to reach one specific line. As an
	example:

	?^Chapter 3$? ; /^Bibliography$/ ; /^Spinoza/ ; /Monads/

	will bring me to the title line of Spinoza's first work with
	"Monads" in the title, in the bibliography for Chapter 3.

	Nor are you limited to search pattern addresses when putting
	together a semicolon-separated address string. If you want to
	reach the first line following line 462 that contains the word
	"union", typing:

	462 ; /\<union\>/

	will bring you there. And any of the addresses can take numerical
	offsets, so:

	462 +137 ; /register int/ ---

	is also a legitimate address string.

	But there are two unfortunate limitations on using
	semicolon-separated address strings. The lesser problem is that
	such a string can use "line zero" as an address only if the
	command following the address string could take line zero by
	itself as its address. That is, you can't even start at line zero
	and then proceed elsewhere with additional addresses, unless the
	command can operate from line zero. So:

	0 ; /Spinoza/ +++ ; /Kant/ delete


	which looks like a reasonable way to be sure your search will
	find the very first "Spinoza" in your file, will actually fail
	with an error message about an illegal address.

	The larger misfortune is that each address in a semicolon-
	separated string must be farther down in the file than the one
	that precedes it. (This means the actual location found, after
	applying any plus-sign or minus-sign offset.) You cannot
	move backward within the series of way points.

	But that does not mean that you cannot use a backward search
	pattern within the string. The first address can be a backward
	search, of course. And a subsequent address can search backward
	if you are certain that the line it finds will actually be more
	forward in the file. For example, you may know that a certain
	backward search will wrap around to the bottom end of the file
	before it finds a match. A common example would be:


	1 ; ?Spinoza? ; /Hegel/ yank

	Beginning a backward search from the first line in the file means
	that the search must start with the last line in the file due to
	wraparound, which guarantees that the search will yank the "Hegel"
	line that follows the vary last "Spinoza" line in your file.

	Also, you can use a plus-sign offset after a backward search when
	you are certain that the line finally found after the offset is
	applied will be farther down in the file than the preceding way
	point had been. Thus, if I want to find the first mention of
	Hegel in Chapter 8 that is at least 120 lines after the last
	mention of him in Chapter 7, I can type:

	/^Chapter 8$/ ; ?Hegel? +119 ; //

	If a command with this address fails and gives an error message
	about a bad address, I'll know that the last mention of Hegel in
	Chapter 7 is more than 120 lines before the end of the chapter, so
	the very first mention of his name in Chapter 8 is what I'm
	looking for. In that case, the address:

	/^Chapter 8$/ ; /Hegel/

	is all that my command needs.

	The situation with forward searches inside a semicolon-separated
	address string is a mirror image of what I've just said. A forward
	search can take a minus-sign offset if you know that the offset is
	small enough that the line found will be further down than the
	last way point. But a forward search will fail, even with no
	offset or a plus-sign offset, if wraparound makes it find a line
	earlier in the file than the way point from which it began.

	Addressing a Section of Text

	TWO ADDRESSES CAN ALSO STAND FOR A RANGE OF LINES. When two
	addresses are separated by a comma rather than a semicolon, the
	meaning changes radically. (What a difference a dot makes!)

	Often you will want a line-mode command to act on a series of
	successive lines. For example, you may want to move a stretch of
	text from one place to another. To do this, you give the address
	of the first line you want the command to act on, followed by the
	last line it should act on, and separate the two addresses with a
	comma. So, the command:

	14 , 17 delete

	will delete line 14 and line 15 and line 16 and line 17. You can
	see that putting more than two addresses in a comma-separated
	address string would be pointless. The line mode of this editor
	is discreet if you ignore this and string together three or more
	addresses with comma separation: it uses the first two addresses
	and discards the rest.

	Any line-mode addresses may be used with a comma. All of the
	following combinations make sense:

	'd , /^struct/
	257 , .
	?^Chapter 9$? , $

	The first address combination would cause the command that follows
	it to operate on the section starting with the line you have
	previously marked "d" and ending with the next forward line that
	begins with "struct", inclusive. The second combination covers
	line 257 through the line you are on now. The third goes backward
	to include the previous line containing only "Chapter 9", and
	forward to include the very last line in your file; plus all the
	lines in between, of course.

	There are limitations on this technique, too. The primary one is
	that the address after the comma (after any offsets, of course)
	cannot be earlier in the file than the address before the comma.
	That is, the range of lines must run forward from the first
	address to the second address. So the command:

	57 , 188 delete

	is just fine, while the similar-looking command:

	188 , 57 delete

	will only produce an error message. (But if the two addresses
	happen to evaluate to the same line, there is no problem. The
	command will silently operate on the one line you've specified.)

	As you work up to more sophisticated line-mode addresses, you may
	get unexpected error messages about the second address being prior
	to first address, when you don't see how you could have
	anticipated that the addresses would evaluate that way. That's no
	disgrace, and the solution is simple. After you've looked over
	the addresses you used, and you're certain that they are the ones
	you want, just type the command in again with the two addresses in
	reverse order. That is, if:

	642 , /in Table 23/ delete

	has failed, giving an error message that the lines are in the
	wrong order, then:

	/in Table 23/ , 642 delete

	will solve that problem.

	The last limitation is that when you use search patterns on both
	sides of a comma, the second search starts from the current line
	just as the first search did; it does not start from the line that
	the first search found. There's a way around that, though, that
	involves using one or more semicolons along with a comma.

	A semicolon-separated address string can be used anywhere in line
	mode that you would use a single address. One very useful
	technique is to use these address strings on one or both sides of
	a comma, to indicate a range of lines to be affected. Remember
	that an address string separated by semicolons is the address of
	just one line, so this one line can be the start or the end of a
	range of lines. For example, in:

	/^INDEX$/ ; /^Xerxes/ , $ write tailfile
	?^PREFACE$? ; /^My 7th point/ , ?^PREFACE$? ; /^In summary/ -- delete

	the first command would write the latter part of the index to a
	new file, while the second could be used to remove a section of a
	book's preface.

	And that brings up the solution to our previous obstacle; the
	second search's starting point. If you want the search after the
	comma to begin from the point the first search found, use the
	first search pattern followed by a semicolon as the start of your
	after-the-comma search string, as in either of:

	?Stradivarius? , ?Stradivarius? ; /Guarnerius/
	?Stradivarius? , ?? ; /Guarnerius/

	In view of the rules about not going backward in line-mode address
	strings, I'd better clarify the way these limitations work when
	you combine semicolon and comma separation, as in these two
	examples. All but the first of the way points in each
	semicolon-separated string must be in the forward direction, of
	course, but the start of the second semicolon-separated string may
	be prior to any of the addresses in the first such string, that
	is, the one-way meter resets itself at the comma point. And using
	semicolon-separated strings on both sides of a comma only requires
	that the final landing point of the second semicolon-separated
	string not be earlier in the file than the final landing point of
	the first; the relative locations of the way points don't matter
	to the comma. To clarify this, consider a couple of odd-looking,
	and useless, but very lucid examples. The combination:

	125 ; 176 ; 221 , 32 ; 67 ; 240

	looks invalid due to the backward jump from line 221 to line 32,
	but is actually a perfectly good address. The back jump comes
	right after the comma, where it is all right. But:

	125 ; 176 ; 221 , 32 ; 67 ; 218

	will produce an error message, because the final landing point of
	the first semicolon-separated string, line 221, falls later in the
	file than the final landing point of the second semicolon-
	separated string, line 218.

	Now, a note about default addresses. I've already mentioned that
	most line-mode commands that can take an address have a "default"
	address built in, which tells the editor where to run the command
	if you don't give an address with it. Each command has its own
	default address, which may be the current line, the current line
	plus the one following, the last line of the file, or the entire
	file.

	The comma separator has default addresses of its own. They are
	the same regardless of what command is being used, and they
	override any command's own default address. If you put a comma
	before a command and don't put an address before the comma, by
	default the address there is the current line. In the same way,
	if you leave out the address after the comma, the default there is
	also the current line. You can even leave out the address in both
	places and use the current-line default in both: that means the
	implied address is "from the current line to the current line",
	which makes the current line the only line the command will
	operate on. So every one of the following command lines:

	. write >> goodlines
	. , . write >> goodlines
	, . write >> goodlines
	. , write >> goodlines
	, write >> goodlines


	will do exactly the same thing: append a copy of just the current
	line in the file you're editing to another file named "goodlines".

	Finally, there is one special symbol that represents a
	comma-separated address combination. The percent sign (%) has the
	same meaning as 1,$ as a line-mode address combination. Both
	refer to the entire file.

	Now You Try It

	Before you try the complex aspects of line-mode addresses in
	actual editing situations, here are some problems you can build
	yourself up on. For each problem I've included a solution that
	will work fairly efficiently.

	1. How can you tell the editor to delete the line that holds
	the very last instance of "EXPORT" in your file? The solution
	is straightforward once you know where to start searching.

	2. Suppose you want to delete the very first line in the file
	with "EXPORT" on it, and that just might be line 1. You can't
	start the search from line zero because the delete command
	cannot take line 0 as an address. When you type the address
	string "$ ; /EXPORT/" to use wraparound, you get an error
	message asserting that the search pattern found a line prior to
	the line found by the "$" address that appeared first, which is
	what you'd expect. How can you tell the editor to find and
	delete this line? The solution requires just a bit of
	creativity.

	3. If you use the address "?abc? , /xyz/", it includes the two
	lines the searches (for "abc" and "xyx") find, as well as all
	the lines between them. How would you specify that you want
	the affected lines to go up to, but not include, the lines the
	two searches find? In this case the solution is simpler than
	you might think.

	Solutions

	Coming Up Next

	The next installment of this tutorial will deal with the global
	commands--they're just too much to absorb right after the
	mind-numbing collection of address forms we've just gone through.
	And to give you more scope for using all these address forms, I'll
	also cover line-mode commands themselves, particularly the ones
	that have more capabilities than you suspect.

	Part 3: The Global Command
	Back to the index