Title: Altair Networking Environment

Title: Altair Networking Environment
Date: January 06, 2019
Tags: altair programming
========================================

After getting the WiFi modem working, the next step was to write a more useful
client for using it. I wanted to remove the need to send commands to the modem
directly and allow for some processing of data coming back. I ended up with a
lot more.

The basic idea of this program was to abstract the modem away from the user. I
wanted a simple interface to connect to a host using a specific protocol and
allow for processing and formatting of output. The scope both shrank
considerably and grew massively.

What I ended up with is a very basic modem use-case. I only implemented a
telnet mode because the telnet client is built into the modem firmware. The
program only has to connect, disconnect, and pass input and output from terminal
to modem like the simple modem program used previously[0]. There is no need to
parse any server output or format anything. There are the remnants of my plans
to implement a gopher client which I want to add next. There is also a "raw"
mode which is functionally the same as the simple program from the previous
modem article and is basically the same thing as the terminal client but without
managing the connection.

A not too difficult program turned very complex because supporting multiple
clients and parsing commands has several possible implementations and user
interactivity. I alternated between specifically optimized implementations and
generalized implementations. The program is also highlighting the limitations
of hand assembling and using the switches for programming. But at the same
time, it was also very revealing about some features of assemblers, compilers,
and command interpreters or shells.

Supporting at least telnet and raw clients necessitated a framework supporting
multiple commands. In the days of BASIC environments, using floppy disks or
cassettes, you could load and execute multiple programs. The environment had to
write that program into memory then begin executing it and provide a way to
abort the program and return to the environment. I simulate this feature by
creating a "command table" listing available commands and the address to jump to
in order to begin their execution. This allows for dynamic loading of commands,
or programs, instead of hard coded compares and jumps. I can also add gopher
later without modifying the command processing code. For simplicity, I only
support single character commands.

If you consider more modern shell environments, you can pass parameters on the
command line when executing the command. The executed command will parse and
check that it received the parameters it requires. My implementation of a
parameter parser ended up somewhere in the middle of generalized and specific to
my needs here. It's written to be called like a library function by any client
(think: a proto-getopt). The subroutine does not know what the parameters are
expected to be or how many there are. It is up to the client to call the
subroutine as many times as needed to get the number of parameters it expects.
Since I am only working with networking clients, I only need a hostname to
connect to and a port. In that order. The port can be optional, which is easy
as the last parameter, since the client will support a specific protocol and can
default to that protocol's standard port.

My original implementation of the parser used hard coded memory locations to
copy each parameter into so the client could then read that location to get the
parameter. That, however, meant a lot of memory planning. I needed to find
space in memory to store strings of unknown length and in the general case, an
unknown number of them. I don't know an efficient way to do this. Also, since
the parameters were entered at the commandline already, I was duplicating
existing data which seemed silly. I needed a way to dynamically find the
parameter in the existing commandline string and just return that. Since the
commandline is a single string and each parameter needs to be a null terminated
string on its own, I still needed to traverse the string and terminate it.
Since parameters are going to be space separated, that leaves a convenient spot
for the null. The address of the start of the parameter is returned in the DE
register. I have the client push the address to the stack for storage to allow
subsequent calls to the parser until the end of the command line is found. The
client can then pull the addresses out of the stack and read the values as
whatever it expects. So ignoring the value of the string, return a 0 if we
found at least one character or a 1 if not to indicate the end of the command
line. The client can decide what to do based on the return value. I am copying
the UNIX and C-like implementation that a 0 indicates success instead of a 1.
Why? There is a command to jump if you have a 0 in the zero flag bit. There is
no command to jump if you get a 1. I could jump if not 0, but if I want to
introduce error codes later, I can jump on 0 to skip any compares that determine
what non-zero value was returned. Just like having the stack grow downwards,
returning a 0 on success was pretty much predestined by the hardware.

Instead of tossing the whole program up then walking through it, let's go
through the new pieces and the parts that taught a lesson. I'll also skip over
describing the stuff we've seen in the previous echo and modem programs.

## Command Entry and Execution ##
Command entry is string based like the string echo[1] program. Input is stored
to memory and when 'enter' is sent, we jump to process what we got.

# Process command #

; process command
; Assumes single char commands, looks for char in command table
; goes to address to execute command. A null terminates the table
; and means the command wasn't found.
343 LDA 072 ; command buffer
344 120Q 120
345 002Q 002
346 CPI 376 ; Check if empty string
347 000Q 000
350 JZ 312 ; goto command entry
351 213Q 213
352 000Q 000
353 MVI M 066 ; Terminate string
354 000Q 000
355 LXI H 041 ; EOL string
356 010Q 010
357 002Q 002
360 CALL 315 ; write string to terminal
361 300Q 300
362 000Q 000
363 LXI H 041 ; Set pointer to start of buffer
364 120Q 120
365 002Q 002
366 MOV A,M 176 ; Get first character
367 INX H 043 ; Go past command character
370 MOV B,A 107 ; Save command char to B
371 LXI H 041 ; load command table
372 103Q 103
373 002Q 002
374 CMP M 276 ; If table char == command char
375 JZ 312 ; goto found
376 015Q 015
377 001Q 001
400 MVI A 076 ; Check for end of table
401 000Q 000
402 CMP M 276 ; If table entry == null
403 JZ 312 ; goto command not found
404 023Q 023
405 001Q 001
406 MOV A,B 170 ; Copy command back from B
407 INX H 043 ; move to next table entry
410 INX H 043
411 INX H 043
412 JMP 303 ; goto next command char
413 374Q 374
414 000Q 000
; command found
415 INX H 043 ; Goto low address byte
416 MOV E,M 136 ; Store in E
417 INX H 043 ; Get high address byte
420 MOV D,M 126 ; Store in D
421 XCHG 353 ; Swap DE into HL
422 PCHL 351 ; Execute command at HL address
; command not found
423 LXI H 041 ; Set HL to error
424 013Q 013
425 002Q 002
426 CALL 315 ; Invalid command
427 300Q 300 ; write string to terminal
430 000Q 000
431 JMP 303 ; Get a new command
432 206Q 206 ; goto command init
433 000Q 000

; command table
1103 t 164Q 164 ; telnet
1104 073Q 073
1105 001Q 001
1106 ! 041Q 041 ; raw
1107 250Q 250
1110 001Q 001
1111 \0 000Q 000 ; null at end of table

Initially, command parsing was hard coded to check for each command character
and jumped accordingly but I quickly realized that adding new commands would be
difficult as it would shift all the code. I decided to put each command into a
table with it's subroutine address. With this, I can add any number of commands
and the same command parser would work. Turns out, this solution is similar to
what CP/M used for calling OS sub-routines (like modern day syscalls) and I'll
explore more about that in the future.

I took some shortcuts since command parsing isn't really what I was trying to
accomplish. I only allow single character commands. Makes it easy to compare
and jump and be on to executing fun things. However, I took the long cut of
allowing parameters to be passed with the command. So, targeting a telnet
client, the user can enter 't hostname port' with port being optional and
defaulting to the telnet port 23. Since telnet always needs a host to connect
to, might as well just enter it there. The same will be true for gopher and any
other network client I might write and add here so it's a useful feature.

All this code does is start at the beginning of the command buffer, read the
character and compare it to the command table. If there is no match, skip ahead
in the table past the sub-routine address to the next command character and
compare again. The table is null terminated to make it easy to know when we've
checked all known commands and can print an error.

If the command is found, read the address into DE, so HL can still be used to
point to the table, then swap DE into HL. Then set the program counter to the
address in HL which causes execution to continue at that addess.

If the command is not found, print an error and reset the command buffer and go
back for another one.

# Parse parameters #

; parse parameters
; expects pointer to args in HL
; returns pointer to terminated param in DE
; and return value in B
434 MVI B 006 ; Preload return with 1
435 001Q 001
436 MOV A,M 176 ; Read until first arg
437 INX H 043
440 CPI 376 ; If space
441 040Q 040
442 JZ 312 ; read next letter
443 036Q 036
444 001Q 001
445 DCX H 053 ; Go back to first non-space character
446 MOV D,H 124 ; Copy param pointer
447 MOV E,L 135 ; to DE
450 MOV A,M 176 ; Get character
451 CPI 376 ; If null
452 000Q 000
453 RZ 310 ; return
454 CPI 376 ; If space
455 040Q 040
456 JZ 312 ; goto done
457 067Q 067
460 001Q 001
461 MVI B 006 ; Found a char
462 000Q 000 ; Set return to 0
463 INX H 043 ; Increment args
464 JMP 303 ; Jump to get character
465 050Q 050
466 001Q 001
; done
467 MVI M 066 ; Terminate parameter string
470 000Q 000
471 INX H 043 ; Move past \0
472 RET 311

; telnet
473 LXI H 041 ; Load command buffer
474 120Q 120
475 002Q 002
476 INX H 043 ; move past command char
477 CALL 315 ; Call parse parameters
500 034Q 034
501 001Q 001
502 MOV A,B 170 ; Get return value
503 CPI 376 ; Check if return is 1
504 001Q 001
505 JZ 312 ; Yes, no hostname
506 207Q 207 ; goto telnet error
507 001Q 001
510 PUSH D 325 ; Save hostname address to stack
511 CALL 315 ; call parse params again
512 034Q 034
513 001Q 001
514 MOV A,B 170 ; Get return value
515 CPI 376 ; Check if return is 1
516 001Q 001
517 JNZ 302 ; No, goto connect
520 134Q 134
521 001Q 001
522 INX H 043 ; Skip over null
523 MVI M 066 ; Set port to 23
524 062Q 062
525 INX H 043
526 MVI M 066
527 063Q 063
530 INX H 043
531 MVI M 066 ; and terminate it
532 000Q 000
533 INX D 023 ; Increment param pointer in DE
; connect
534 PUSH D 325 ; Push port address to stack
..

Here is a snippet of the telnet program which uses parse parameters. More
shortcuts. Telnet knows it needs a hostname and that the port can be optional.
Think of it as telnet working with something like the ARGV variable we have in C
today. It starts with the command itself and the parameters follow. So telnet
starts at the beginning of the command buffer and skips ahead a character past
the command itself. The parse parameters sub-routine expects HL to be the
pointer to the current location in the command buffer so subsequent calls will
continue on to the next parameter until the end of the buffer. It returns the
start of the parameter value in DE and a 0 if a value was found or a 1 if not.

Parse parameters starts by setting the return value to 1 in case nothing is
found. Then we grab characters and loop over spaces until a non-space
character. The side effect here is that we don't need a space after the
command, but we need at least one space between parameters. Once we find a
non-space character, move the pointer back to set up for the next read loop.
That means we will read this character twice. I should try to optimize that.
That also puts the pointer in HL at the beginning of the parameter string so we
can save it to DE. Read the character, if we have a null, we're done. If not,
check again for a space, if not we have a parameter value so set the return
value to 0. Check the next characters until space or null. If space, write a
null over the space to terminate the parameter value string and increment the HL
pointer past it. Then return to the caller which might call back for another
parameter.

This process allows a program to call parse parameters until there is a return
value of 0. The program keeps track of the number of parameters it needs and if
it got enough of them. Telnet, in this example, pushes each parameter pointer
to the stack which allows for an arbitrary number of parameters to be found. We
know telnet only needs one parameter and can fill in a default for the second so
we only use a return of 0 to indicate the optional port parameter wasn't passed
and we need to supply a default. That is done by writing it to the command
buffer as if it had been entered by the user and since parse parameters saved
the address it thought might be a parameter value in DE we can use it by
incrementing and pushing it to the stack. If there are more parameters after a
port, they just get ignored.

I initially had parse parameters read from the command buffer and copy values
into new locations but it occurred to me that if the values were already in
memory, as entered by the user, why not just use that? Also managing memory
manually is hard. Where would I put those values? What if there are a lot of
them? We'll dig into more dynamic memory management some day, I hope.

## Memory management ##
Speaking of managing memory, enough iterations happened while writing this
program that I had to map out where things were so when they changed, I would
know what to update. When manually assembling, you also need to manually keep
tack of where everything will be in memory.

000 start up
025 delete
070 interrupt handler
200 reset
206 command init
213 command entry
265 write char to terminal
300 write string to terminal
314 write char to modem
327 write string to modem
343 process command
434 (001 034) parse parameters
467 (001 067) done
473 (001 073) telnet
534 (001 134) connect
607 (001 207) telnet error
650 (001 250) raw mode
675 (001 275) raw terminal handler
720 (001 320) raw modem handler
736 (001 336) abort connection
(754 end of program)

static data
1000 (002 000) hostname error
1010 EOL
1013 invalid command
1035 telnet command prefix
1043 telnet command suffix
1046 gopher command prefix
1053 gopher command suffix
1056 modem echo off
1064 modem echo on
1072 modem disconnect
1077 delete string
(1102 end of static data)

command table
1103 (002 103)
(1111 end of table)

dynamic data
1120 (002 120) command buffer

Each sub-routine's starting address is listed as well as any other jump points
into it. Also all the static data: strings and the command table, and dynamic
storage: in this case, just the command buffer. An assembler will allow you to
create data statements to store numbers and strings without having to worry
about their location.

I also got to learn first hand why the world switched from octal to hexadecimal.
I already knew this, but now I've lived it. As you can see, I have
parentheticals to show the separate high and low bytes in octal. This
illustrates the problem with octal. When you separate the bytes, the values
change. So an address of 434Q, when stored as 2 8 bit bytes becomes 001Q 034Q.
The same address in hexadecimal would be 0x011C which separates out to 0x01 and
0x1C. Much easier to deal with and why hexadecimal as adopted over octal.

This also highlights how bad life can be without an assembler. Every tweak and
change to code changes the memory values after it and all references need to be
updated. An assembler takes care of that for you and allows you to use labels
to reference specific addresses without needing to know the specific address.

## Conclusions ##
The lesson learned from all of this was not how to write an interface for using
the WiFi modem but rather how the development workflow needed to evolve very
quickly to allow more complex programming. This is not a particularly long
program (though the longest I have written for the Altair) but I think I have
already reached the limit of hand assembly as well as manual entry.

The solution to the second problem is the next step in the history of home
computing. As teletypes and terminals became more available, more efficient
program entry was possible. Monitor programs, which were stored in non-volatile
PROM chips allowed interaction with the system using a teletype or terminal and
also storage of boot loaders for other systems like BASIC.

I want to write a custom monitor that will allow me to leverage my modern
"terminal" (a computer with a terminal emulator) to write programs to the Altair
and read them back to store on the computer.

From there we can add assembly-like functionality to make programming easier and
more easily improve our command environment that's been started here.

--------------------------------------------------------------------------------

I'm not going to get into the telnet program here, this article is long enough
and there isn't much more of interest to it. It has some missing features and
it's just been too difficult to work on to continue as is. I already cheated
and wrote it in a text editor on my computer. I knew there would be too much
iteration to do on paper. I'll revisit and share it when we have a better
development process.

[0] https://blog.kagu-tsuchi.com/articles/altair_on_wifi.html
[1] https://blog.kagu-tsuchi.com/articles/8080_IO_string_echo.html