KNS - Ken's Name Server

What it is
Access and Security
The Transaction Engine
The Command Set
Kns, the Client Utility
The KNS API

What it is

Ken's Name Server is a simple transaction server that maintains a basic database of name-value pairs. The database is maintained on several servers to provide redundancy in case one server dies or is shut down due to maintenance.

The underlying mechanism is a general purpose, fault tolerant transaction engine. This is the same basic engine that is used to support our passwd file synchronization and email server aliasing.

The actual database is a simple a DBM database with the name acting as the key and the value acting as the content. This is backed up by the transaction logs that the transaction engine maintains. The transaction logs can be replayed at any time to rebuild the database.

Access and Security

The name server uses Jim Fox's Lightweight Secure Connection Library to control access to the database. Each client connecting to the KNS dæmon must specify a valid LSC key. LSC keys have a public portion and a private portion. The key names are in the form:

knsX_group_member

which encodes the following information:

X: Security level

The security level is a hexidecimal digit from 0 to F. The higher the level, the more privileged the client.

	0:	Level zero is unprivileged and can only access unprivileged commands.
	2:	Level 2 is required to retrieve values.
	3:	Level 3 is required to change values.
	4:	Level 4 is required to delete existing names.
	5:	Level 5 is required to create new names.
	A:	Level 10 is required to checkpoint the database.
	C:	Level 12 is required to checksum the database.
	F:	The master servers all run at level 15.

group

The group is an arbitrary character string.

member

The member is an arbitrary character string, but for practical purposes should be the host name of the computer owning the key.

The public keys for the servers need to be distributed to each of the servers. There's a script, newkey.sh in the installation materials that will create a new private and public key and update the global file that gets installed everywhere. This script can be used when adding a new host to the list of server hosts.

The Transaction Engine

The KNS servers are capable of forming a many tiered hiearchy of servers with a master at the top level, two or more chiefs at the second level and countless slaves at the third and lower levels. In actuality, though, it is unlikely that KNS would require more than a master and a couple of chiefs.

The clients who want to update the database will connect to the master KNS server. Commands coming into the master that affect the database in some way will be assigned the next transaction number by the master (after it validates that it is a legitimate request). This transaction will then be passed to each of the chiefs and then processed by the master itself and then written to the master's log. Each chief (and slave) in turn will pass the command to its slaves, process the command and log it. After the transaction number is assigned and passed to one or more of the chiefs, the master will bail out if it encounters some sort of error such as a full disk condition. One of the chiefs it sent that command to will now have the highest sequence number and that chief will become the new master. When the failing server is all better and reconnects to the new master as a chief, that master will pass the transaction that failed back to it and it will attempt to reprocess it.

When a server comes up for the first time or after being down for awhile it reads its configuration file to get the list of potential master servers and its log file to determine its last sequence number. It then sends out an "I am here" datagram to all the known servers asking for a suitable server to connect to. If it gets no answer, and it's a potential master itself, it will increment a counter and send its query again. While it's waiting for a response, if it receives an "I am here" query from another server with a higher sequence number it will reset its counter back down. Eventually one server will increment its counter to three and declare itself the master.

When the master server receives an "I am here" query it will reply with an "Attach to me" response unless it already has too many clients in which case it will pass the query down to one of its subordinant chiefs or slaves to respond. When the server connects it will request all transactions since its last transaction. Each transaction specifies the server ordinal that originally made the transaction and who made the previous transaction. If a transaction is out of sequence or has a mismatching previous transaction server the server goes into limbo, not accepting any new clients and not processing any new transactions, until things can be corrected by hand. This ensures that no transactions are missed and that all the connected servers are on the same page.

If the network becomes partitioned, multiple servers may declare themselves master and continue to process new transactions. If the master server does not have all of its chiefs connected, it will periodically send out "Hey, are you there" queries to the missing server(s). When the network reattaches itself, the two master servers will end up sending each other such a message. The one that processed the fewer number of transactions will yield the master status and attempt to connect to the other master. It may decide to put itself into limbo if the transaction history doesn't match up properly. In this case, the operator will be notified and someone will have to come in and do damage control by hand. The likely scenerio is to cut the limboed server's log off at the discrepancy, get it connected and then then reapply the transactions that followed that point. As this situation seldom, if ever, comes up, there is no automated procedure for correcting from this situation.

The Command Set

The set of commands supported by the server are:

Create name value

The Create command creates a name and gives it the specified value. If the name already exists, the Create command will produce an error. The Create command is converted to a Set command by the master server prior to being sent to subordinant servers or being written to the log file. This allows the command to be reprocessed if the database needs to be rebuilt.

Delete name

The Delete command deletes a name-value pair from the database. Deleting a nonexistant name is not an error and simply does nothing.

Get name

The Get command retrieves a value from the database for the given name. If the name is not in the database an error is returned.

List pattern

All names matching the pattern are returned. The pattern is defined using standard ex regular expressions. It is much faster to use Get if you know the full name.

Set name value

Assigns the given value to the specified name, creating it if necessary.

KNS command [parameters]

The above commands are all database specific. The KNS subcommands that follow are all generic commands that the basic transaction engine implements. These commands are used by the servers to communicate with each other.

Attach address port seq: This request comes down from the master, telling the slave server to have the client attach to itself. This request may be passed down to a slave of the slave if the slave already has enough clients.
Balance count: The balance command instructs the server to drop the specified number of slave servers in order to balance the tree. Portions of the balance command will be passed down to slave servers or the whole subtrees will be dropped.
Checkpoint: The checkpoint command instructs the server to dump its database out to a base file and start a new log file. To rebuild the database, a server will replay its base file followed by the current log file. The checkpoint shortens the process by allowing the server to skip all the changes that were superseded by subsequent changes.
Checksum [Forward] [Dump] [Summary]: The checksum command instructs the server to generate a checksum of its database. The Forward option causes the command to be forwarded to all slave servers as well as the current one. The Dump option causes a full dump of the database while the Summary option simply generates a integer checksum. This checksum, written to a file in a common directory, is used to verify the integrity of the database with respect to the other servers' databases. A periodic process will connect as a client and ask for the summary and send an email message if not all servers generate the same value. Manual debugging can then use the Dump option to figure out why the databases dont jive.
Drop client: The Drop command is used to tell the server to drop a specific client. This is used in the balancing process to keep the tree balanced.
Newseq seq: The Newseq command is used to skip to a specific sequence number. This is used to synchronize the base file to the log file. The last command of the base file and the first command of the log file will always be Newseq commands.
Ruthere: Asks the server "Are you there?". If no other activity has occurred recently, a slave server will send this query to its master to verify that the master is still alive. If it gets no answer it will disconnect and attempt to reconnect or become the master itself.
Server seq min_seq: When a server connects into the hierachy, it sends the server command saying that it is a server rather than a standard client. The master will then pass all transactions since the specified sequence number to catch the server up to the rest of the pack.
Status min_seq num_child: When its status changes, a slave will notify its master with a status command. This allows the master to determine which slave to pass clients to in order to maintain a balanced tree.

Kns, the Client Utility

A simple utility exists to pass commands to the KNS servers. The general usage is:

kns -c 'command'

The specified command is sent to the server. If no -c option is specified or a single dash is specified for the command, all lines on stdin are processed as separate commands until there is an error, end of file or an end/exit/quit command is encountered.

The KNS API

Communication with the KNS servers is accomplished through the KNS Application Programming Interface. This library contains the following routines:

extern char *kns_errmsg;
extern int  kns_errno;

void ksn_close(void);
int  kns_cmd(char *msg);
int  kns_conf(char *dir);
void kns_deinit(void);
int  kns_init(char *dir);
int  kns_open(void);
int  kns_rc(int rc);

General Basics

The normal sequence is to call kns_init with a null parameter to things set up. This initializes the various global variables and calls the kns_conf routine to read the configuration file, /usr/local/lib/kns/conf, which defines the host and tcp/udp port and ordinal for the KNS servers.

Once initialized, the kns_cmd routine is called with an ASCII command to be processed by the server. Kns_cmd will call kns_open if necessary to get connected to the current master server. The reply from kns_cmd will indicate success or failure:

KNS_RETRY -2 Didn't work, but may later

KNS_ERROR -1 Unsuccessful return value

KNS_OKAY 0 Successful return value

KNS_MORE 1 More data available

In addition, the kar_errno variable will contain an extended error status as defined by the KNS_E_xxx constants in the kns.h include file and kns_errmsg will point to a suitable message to be displayed as part of an error message as well as any response from the server. Note that these two global variables (and others) indicate that the KNS API is not officially "thread safe". Only one thread, normally the main thread, should be making calls to the KNS library.

The kns_deinit routine can be called followed by a second call to kns_init to force a rereading of the configuration for processes that may continue to run for extended periods during which the configuration file is expected to change. The kns_close routine is called internally by kns_deinit and is not expected to be referenced by client applications directly.

The kns_rc routine is used internally to set the kns_errno and kns_errmsg variables and return the actual reply code. It also throttles requests that result in KNS_RETRY responses down to one every 10 seconds to reduce the overhead of processes spinning their wheels. This routine is not intended to be called directly by the client.

Ken Lowe
Email -- ken@u.washington.edu
Web -- http://staff.washington.edu/krl/

KNS_RETRY	-2	Didn't work, but may later
KNS_ERROR	-1	Unsuccessful return value
KNS_OKAY	0	Successful return value
KNS_MORE	1	More data available