The Coda HOWTO <author> Peter Braam, <tt/braam@cs.cmu.edu/, Robert Baron <tt/rvb@cs.cmu.edu/, Jan Harkes <tt/jaharkes@cs.cmu.edu/, Marc Schnieder <tt/smarc@cs.cmu.edu/. </author> <date>v0.50, Dec 12, 1998 <abstract> This HOWTO should give you help in running Coda. We try to cover enough detail for you to install, build and run Coda, and also explain how to troubleshoot and prepare useful bug reports. </abstract> <toc> <sect>Coda's ingredients <sect1> What is Coda ? Coda is a distributed file system, i.e. it makes files available to a collection of client computers as part of their directory tree, but ultimately maintains the authoritative copy of the file data on servers. Coda has some features that make it stand out: it supports <em/disconnected operation/, i.e. full access to a cached section of the file space during voluntary or involuntary network or server outages. Coda will automatically reintegrate the changes made on disconnected clients when reconnecting. Further Coda has read write, failover server replication, meaning that data is stored and fetch from any of a group of servers and Coda will continue to operate when only a subset of all servers is available. If server differences arise due to <em/network partitions/ Coda will resolve differences automatically to a maximum extent possible and aid users in repairing what can't be done automatically. Coda is very differently organized from NFS and Windows/Samba shares. Coda does have many similarities to AFS and DCE/DFS. <sect1> Getting clued in with the Coda terminology <descrip> <tag/A single name space/ All of Coda appears under a single directory <tt>/coda</tt> on the client (or under a single drive under Windows). Coda does not have different exports or shares as do NFS and Samba that are individually mounted. Under <tt>/coda</tt> the volumes (aka file sets) of files exported by all the servers (living in your Coda cell) are visible. Coda automatically finds servers and all a client needs to know is the name of one bootstrap server that gives it information how to find the root volume of Coda. <tag/A Coda cell/ is a group of servers sharing one set of configuration databases. A cell can consist of a single server or up to hundreds of servers. One server is designated as the <bf>SCM</bf>, the system Control machine. It is distinguished by being the only server modifying the configuration databases shared by all servers, and propagating such changes to other servers. At present a Coda client can belong to a single cell. We hope to get a cell mechanism into Coda whereby a client can see files in multiple cells. <tag/Coda volumes:/ File servers group the files in volumes. A volume is typically much smaller than a partition and much larger than a directory. Volumes have a root and contain a directory tree with files. Each volume is "Coda mounted" somewhere under /coda and forms a subtree of the /coda. Volumes can contain mountpoints of other volumes. A volume mountpoint is not a Unix mountpoint or Windows drive - there is only one drive or Unix mountpoint for Coda. A Coda mountpoint contains enough information for the client to find the server(s) which store the files in the volume. The group of servers serving a volume is called the Volume Storage Group of the volume. <tag/Volume Mountpoints/ One volume is special, it is the root volume, the volume which Coda mounts on <tt>/coda</tt>. Other volumes are grafted into the <tt>/coda</tt> tree using the <bf> cfs makemount</bf>. This command installs a volume mountpoint in the Coda directory tree, and in effect its result is similar to <bf> mkdir mountpoint ; mount device mountpoint </bf> under Unix. When invoking the <bf> cfs makemount</bf> the two arguments given are the name of the mountpoint and the name of the volume to be mounted. Coda mountpoints are persistent objects, unlike Unix mountpoints which needs reinstating after a reboot. <tag/Data storage/ The servers do not store and export volumes as directories in the local disk filesystem, like NFS and Samba. Coda needs much more meta data to support server replication and disconnected operation and it has complex recovery which is hard to do within a local disk filesystem. Coda servers store files identified by a number typically all under a directory <tt>/vicepa</tt>. The meta data (owners, access control lists, version vectors) and directory contents is stored in an RVM data file which would often be a raw disk partition. <tag/RVM/ stands for <em/Recoverable Virtual Memory/. RVM is a transaction based library to make part of a virtual address space of a process persistent on disk and commit changes to this memory atomically to persistent storage. Coda uses RVM to manage its metadata. This data is stored in an RVM data file which is mapped into memory upon startup. Modifications are made in VM and also writtent to the RVM LOG file upon committing a transaction. The LOG file contains committed data that has not yet been incorporated into the data file on disk. <tag/Client data/ is stored somewhat similarly: meta data in RVM (typically in <tt>/usr/coda/DATA</tt>) and cached files are stored by number under <tt>/usr/coda/venus.cache</tt>. The cache on a client is persistent. This cache contains copies of files on the server. The cache allows for quicker access to data for the client and allows for access to files when the client is not connected to the server. <tag/Validation/ When Coda detects that a server is reachable again it will <em/validate/ cached data before using it to make sure the cached data is the latest version of the file. Coda compares cached version stamps associated with each object, with version stamps held by the server. <tag/Authentication/ Coda manages authentication and authorization through a token. Similar (the details are very different) to using a Windows share, Coda requires users to log in. During the log in process, the client acquires a session key, or token in exchange for a correct password. The token is associated with a user identity, at present this Coda identity is the uid of the user performing the log in. <tag/Protection/ To grant permissions the cache manager and servers use the token with its associated identity and match this against priviliges granted to this identity in access control lists (ACL). If a token is not present, anonymous access is assumed, for which permissions are again granted through the access control lists. </descrip> <sect1> Organization of the client <sect2> The kernel module and the cache manager Like every filesystem a computer enabled to use the Coda filesystem needs kernel support to access Coda files. Coda's kernel support is minimal and works in conjunction with the cache manager <bf>Venus</bf>. User requests enter the kernel, which will either reply directly or ask the cache manager <bf>venus</bf> to assist in service. Typically the kernel code is in a kernel module, which is either loaded at boot time or dynamically loaded when Venus is started. Venus will even mount the Coda filesystem on <tt> /coda</tt>. <sect2> Utilities To manipulate acl's, the cache, volume mountpoints and possibly the network behaviour of a Coda client a variety of small utilities is provided. The most important one is the <bf>cfs </bf> command. There is also a <bf>clog</bf> program to authenticate to the Coda authentication server. The <bf/codacon/ programm allows one to monitor the operatoin of the cache manager, and <bf/cmon/ program gives summary information about a list of servers. <sect1> The server organization The main program is the Coda fileserver <bf>codasrv</bf>. It is responsible for doing all file operations, as well as volume location service. The Coda authentication server <bf>auth2</bf> handles requests from <bf>clog</bf> for tokens, and changes of password from <bf>au</bf> and <bf>cpasswd</bf>. Only the the auth2 process on the SCM will modify the password database. All servers in a Coda cell share the configuration databases in <tt>/vice/db</tt> and retrieve them from the SCM when changes have occurred. The <bf>updateclnt</bf> program is responsible for retrieving such changes, and it polls the <bf>updatesrv</bf> on the SCM to see if anything has changed. Sometimes the SCM needs a (non-shared) database from another server to update a shared database. It fetches this through an <bf>updatesrv</bf> process on that server using <bf>updatefetch</bf>. <sect2> Utilities On the server there are utilities for volume creation and management. These utilities consist of shell scripts and the volutil command. There is also a tool to manipulate the protection databases. <sect> Getting Coda All code can be obtained from: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda">. For any platform you need: <itemize> <item>documentation and manual pages <em/coda-doc/ <item>user level client code for coda: <em/coda-client/ <item>a server: <em/coda-debug-server/ rpms <item> kernel code for the client: <em/coda-kernel-module/ rpms code </itemize> <bf/NOTE:/ Coda is <em/platform independent/. You can mix and match servers and clients on any supported platform. <sect1> Linux <sect2> Getting RPM packages We have RPMS for a RedHat Linux at <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/linux/" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/linux/">. You should get: <itemize> <item>documentation and manual pages <em/coda-doc/ rpms <item>a client: <em/coda-debug-client/ rpms <item>a server: <em/coda-debug-server/ rpms <item> kernel code for the client: <em/coda-kernel-module/ rpms code </itemize> Install these as ususal. You may need to build a Coda kernel module for which you will need source. Look at the section <ref id="Modules" name="building a kernel module."> <sect1> FreeBSD <sect2> Obtaining Packages As of this writing, the elf packages have been built but not tested as heavily as the aout packages. You may obtain the newest compiled packages from the <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/freebsd/3.0-elf/i386/" NAME="Coda site"> or other mirrors. <bf/NOTE:/ Below I am using a specific Coda release, 4.6.6.1, you will no doubt find a different release on the ftp site when you access it; so use the newer release version. <itemize> <item> Coda-intro is a quick get started guide to using Coda. <tscreen> <verb> pkg_add coda-intro-4.6.6.1.tgz </verb> </tscreen> <item> The Coda client is needed by a machine to use the Coda filesystem. Aprox Size = 3.6Meg <tscreen> <verb> pkg_add coda-client-4.6.6.1.tgz </verb> </tscreen> <item> The Coda server is needed by a machine to create and serve the filesystem to client machines. Aprox Size = 5.5Meg <tscreen> <verb> pkg_add coda-server-4.6.6.1.tgz </verb> </tscreen> <item> Finally, we have a wealth of documentation about Coda and need to generate still more. Aprox Size = 1.5Meg <tscreen> <verb> pkg_add coda-doc-4.6.5.3.tgz </verb> </tscreen> </itemize> If you substitute 3.0-aout for 3.0-elf in the ftp directory, you will get the aout packages. The aout packages for the client and server are about 1.5Meg larger than the elf packages. <bf/NOTE:/ pkg_add may fail silently. Please make sure you have at least 25MB of free space available in <tt>/var/tmp</tt> to install the each package. <sect1> NetBSD <sect2> Obtaining Pre-compiled Packages You may obtain the newest pre-compiled packages from the Coda site <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/netbsd/current/i386/" NAME="Coda site"> or other mirrors. <bf/NOTE:/ Below I am using a specific Coda release, 4.6.6.1, you will no doubt find a different release on the ftp site when you access it; so use the newer release version. <itemize> <item> Coda-intro is a quick get started guide to using Coda. <tscreen> <verb> pkg_add coda-intro-4.6.6.1.tgz </verb> </tscreen> <item> The Coda client is needed by a machine to use the Coda filesystem. Aprox Size = 5Meg <tscreen> <verb> pkg_add coda-client-4.6.6.1.tgz </verb> </tscreen> <item> The Coda server is needed by a machine to create and serve the filesystem to client machines. Aprox Size = 6.8Meg <tscreen> <verb> pkg_add coda-server-4.6.6.1.tgz </verb> </tscreen> <item> Finally, we have a wealth of documentation about Coda and need to generate still more. Aprox Size = 1.5Meg <tscreen> <verb> pkg_add coda-doc-4.6.5.3.tgz </verb> </tscreen> </itemize> <bf/Note:/ pkg_add may fail silently. Please make sure you have at least 25MB of free space available in /var/tmp to install the each package. <sect1> Windows 95 Get the installer files for servers and clients from <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/win95" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/win95">. The Server has not been tested on Windows 95 and might not run at all. Use the Server for Windows NT instead. Install these by double clicking on the installer files. <bf/NOTE:/ Before you start, make sure you know the IP address of your client and of the server you are connecting to. The default server address offered is that of the testserver at CMU. <sect1> Windows NT Do all your work as administrator. First get the Cygwin B19 kit from <URL URL="http://www.cygnus.com/misc/gnu-win32/" NAME="http://www.cygnus.com/misc/gnu-win32/"> and install this on the NT machine, for example in C:\Cygnus Get the installer files for servers and clients from <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/winnt" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/winnt">. Install these by double clicking on the installer files. (Note: these replace the original cygwin32.dll with a patched version. A future version of Coda will strive to use the default version). <sect> Configuring and starting Coda <bf/WARNING:/ CODA IS BARELY READY FOR PRODUCTION USE. THIS RELEASE IS JUST FOR THOSE INTERESTED IN EXPLORING THOSE FEATURES WHICH WORK. IT CONTAINS KERNEL CODE, AND SERVERS RUNNING WITH ROOT PRIVILEGES, AND COULD LEAD TO DATA LOSS. To get Coda running you will go through 3 steps: <enum> <item> Get a Coda enabled kernel for the client. (The server does not need a special kernel) <item> Configure and run the client cache manager Venus <item> Configure and run the file server, authentication server and update servers <item> Connect a client to your new server. </enum> <sect1> Getting a Coda enabled kernel Before you can use a Coda client you need a new filesystem driver in your kernel. For the most part this driver redirects requests to the user level cache manager <bf>venus</bf>. Precompiled modules exist for commonly used kernels. If no module exists for you, you'll find instructions below where to find it. <sect2> Linux Get a <URL NAME="coda-fs-module-k2.?.?-c?.?.?.arch.rpm" URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/linux"> rpm package and install it. <tt/k2.?.?/ should match your kernel version (use uname -a) and <tt/c?.?.?/ should match you Coda version. Linux kernels change often and many people have custom kernels for their environment. Our modules will generally only work on RedHat Linux kernels and you may have to build a module for your kernel. Look at the section <ref id="Modules" name="building a kernel module."> <sect2> FreeBSD You may obtain a Coda lkm from your FreeBSD distribution: <tscreen><verb> cd /lkm </verb></tscreen> Check if <tt/coda_mod.o/ is there already. Otherwise, you may obtain a Coda lkm from the Coda site: <tscreen> <verb> ftp://ftp.coda.cs.cmu.edu/pub/coda/freebsd/3.0-aout/i386/coda_mod.o </verb> </tscreen> You then install the lkm with: <tscreen> <verb> modload -v -e coda_mod -o /var/run/lkm.coda /lkm/coda_mod.o </verb> </tscreen> You can build support for the Coda VFS layer into your kernel. This is discussed later in the <em/Building FreeBSD/ section. <sect2> NetBSD <bf/NOTE:/ The GENERIC NetBSD kernel should have Coda enabled. Do an: <tscreen> <verb> nm -o /netbsd | grep coda_open </verb> </tscreen> If this is present, you are done with this section. <bf/PROCEED NO FURTHER/ If you need to load an lkm it should be in the NetBSD distribution: <tscreen> <verb> cd /usr/lkm </verb> </tscreen> Check if coda.o is there already. Otherwise, you may obtain a Coda lkm from the Coda site: <tscreen> <verb> ftp://ftp.coda.cs.cmu.edu/pub/coda/netbsd/current/i386/coda-1_3H.o </verb> </tscreen> You then install the lkm with: <tscreen> <verb> modload -v -e coda_lkmentry -o /var/run/lkm.coda /usr/lkm/coda-1_3H.o </verb> </tscreen> You can build support for the Coda VFS layer into your kernel, though this should be automatic. This is discussed later in the <em/Building NetBSD/ section. <sect2> Windows 95 & NT The kernel module is part of the Coda client installer. <sect1>Linux and the BSD's: running Venus, the client cache manager These are partial instructions on how to setup and configure the Coda filesystem. Refinements to the setup created here are discussed in <URL URL="http://ftp.coda.cs.cmu.edu/doc/html/manual.html" NAME="http://ftp.coda.cs.cmu.edu/doc/html/manual.html"> You will probably not need these refinements in the first instance. The <tt/venus-setup/ script does all the hard work, it will setup the coda control files, create <tt>/dev/cfs0</tt> to communicate with the kernel, ... It also initializes a directory for cache files. In your first Coda run we recommend a small cache, say 20MB. The cache size should be at least 10Meg, typically 60-200Meg is used. Do not go above 300Meg. All the files created will be placed under <tt>/usr/coda</tt>. You should make sure that there is enough space in the file system on which <tt>/usr/coda</tt> resides to hold a fully populated cache. <tscreen> <verb> venus-setup <comma_separated_host_list> <cache_size_in_kb> </verb> </tscreen> <tt/venus-setup/ and <tt/venus/ (below) are in <tt>/usr/sbin</tt>. Make sure that <tt>/usr/sbin</tt> is in your path or that you use fully qualified pathnames. We strongly recommend that you try testserver.coda.cs.cmu.edu as the <tt/comma_separated_host_list/ first, and keep the cache size to 20000. <bf/NOTE:/ <tt/venus-setup/ will edit <tt>/etc/services</tt> to add some additional services. The following assumes you are running X-Windows. However, you could run these commands from virtual consoles as well, by omitting the <tt/xterm -e/ in front of the commands below. Start Venus with: <tscreen> <verb> venus & </verb> </tscreen> An <tt/-init/ flag can be given when <tt/venus/ is started; it flushes the local cache contents. <tt/venus-setup/ forces an init to happen when venus is first started. The <tt/-init/ flag can be given if Coda cannot recover it's cache after a crash, or after editing the <tt/vstab/ file manually. Observe the venus log with: <tscreen> <verb> xterm -e tail -f /usr/coda/etc/console </verb> </tscreen> It will tell you when venus has started and give status. Type: <tscreen> <verb> xterm -e codacon & </verb> </tscreen> to see the communications between the Venus and Vice. It is possible to see the upcalls from the kernel to Venus by turning up logging in Venus, but they are not very interesting. (To turn on minimal debugging, type: <tscreen> <verb> vutil -d 1 </verb> </tscreen> and then <tt>tail -f /usr/coda/coda.cache/venus.log</tt>.) To halt venus, type: <tscreen> <verb> vutil shutdown umount /coda (Linux only) </verb> </tscreen> Or you can kill -9 venus, if you must. <bf/NOTES for Linux users:/ <itemize> <item> Before restarting Venus <tt>/coda</tt> must be unmounted. If umounting <tt>/coda</tt> gives trouble, make sure to exit all process that hang on to Coda, e.g. by having files open or being cd'd into /coda. A utility like <tt/lsof/ and <tt/fuser/ can help with this. <item> <tt>/proc/fs/coda</tt> has interesting Coda statistics. <item> You can enable kernel debugging with <tt>vutil -kdebug 4095</tt> and call tracing with <tt>vutil ktrace 1</tt>. The messages appear in <tt>/var/log/messages</tt> </itemize> <sect1>Windows 95: Starting and Configuring a Coda client<label id="sec-win95-start-conf"> During installation you will be prompted for the IP address of your client and of your Coda server(s). Enter this as indicated. All executables can be found in the directory <tt>C:\usr\coda\bin.</tt> <enum> <item> Type <tscreen> <verb> relay.exe </verb> </tscreen> in a command Window. This is a 32bit DOS application which you can also start by double clicking on the file in explorer. <item> If this is the first time you start Coda run <tscreen> <verb> venus.exe -child </verb> </tscreen> in a command window. <item> Run <tscreen> <verb> venus.exe -cf 500 -init </verb> </tscreen> (this gives up to 500 cache files, try with more if everythings works fine.) <item> When Venus prints "Venus starting ..." run <tscreen> <verb> mount.exe N: </verb> </tscreen> This maps the Coda filesystem to the <tt>N:</tt> drive. </enum> You are now ready to browse through the Coda filesystem using the explorer! <enum> <item> When you want to restart Venus without initializing the cache just run <tt>venus.exe</tt>. If <tt/venus/ crashes restarting it is normally all you need to do. <item> If relay crashes you will want to unmount the <tt>N:</tt> drive run: <tscreen> <verb> unmount.exe </verb> </tscreen> This will stop <tt>relay.exe</tt> (if it is still running) and unload <tt>codadev.vxd</tt> which was loaded by <tt>relay.exe</tt>. </enum> <sect1> Codasrv -- the file server <label id="codasrv"> <sect2>Configuring your server To set up an SCM server, you will run a script <em/vice-setup/. This script creates configuration files under the <tt> /vice </tt> directory and creates files and directories on your system for storage of file data and metatdata. To answer the questions <em/vice-setup/ is asking you need to have the following thought through: <descrip> <tag/file space/an empty directory (viz /vicepa) where the fileserver will put files. There must be as much free space on this filesystem as the data you wish to store in Coda. <tag/RVM metadata storage/ a file or raw partition for RVM metadata. You can use a file but it will be quite slow on a larger server. This partition must be around 4% of the total size of the files you wish to store under /vicepa (e.g. on a 2GB server we use around 80M of rvm data). For first installations we recommend the default 22M options, and using <em/files/ for RVM log and data. (NOTE: Windows NT Setup creates the file c:\coda\rvm\DATA. Use this for RVM metadata.) <tag/virtual memory/ The metadata, held in the RVM data file, is memory mapped. You need that amount of space as virtual memory on your system, in addition to virtual memory to run the server (~6MB) and other software. <tag/RVM transaction log/a LOG file, preferably a raw partition on a disk by itself. This needs not be large, a few M's are fine. (NOTE: Windows NT Setup creates the file c:\coda\rvm\LOG. Use this for RVM transaction log.) <tag/A server number/ All servers in a coda cell need to have a unique number to identify them. The servername to identifier mappings have to be defined by the administrator in the file <tt>/vice/db/servers</tt> on the SCM. The format of this file is as follows: <tscreen> <verb> servernameX.domain.name 1 servernameY.domain.name 2 ... </verb> </tscreen> There are currently several limitations to which identifiers are actually usable: <itemize> <item> all numbers must fit in a single byte. <item> 0 and -1 (255) are used in error conditions. <item> 127 is used to identify `replicated volumes'. </itemize> This leaves us with a usable range of 1-126 and 128-245 for server identifiers. <tag/secret tokens/two secret tokens of _exactly_ 8 characters (eg elephant). </descrip> <bf/NOTE:/ you are now ready to run the setup script. We <bf/strongly/ recommend that you stick to default choices offered as configuring a server differently is quite difficult. Then run: <tscreen> <verb> vice-setup </verb> </tscreen> and answer its questions. Note down the commands that vice-setup prints out for you at the end. <bf>Note: </bf> For Windows NT you will need the Cygwin B19 Shell which can be started from the "Start" menu. The shell uses the c:\coda directory as the root mountpoint "/". <sect2> Running Vice: Start the rpc2portmap server, update server, client and the auth server, as well as the fileserver by typing: <descrip> <tag/Linux/ <tscreen> <verb> /etc/rc.d/init.d/auth2.init start /etc/rc.d/init.d/update.init start /etc/rc.d/init.d/codasrv.init start </verb> </tscreen> <tag/BSDs/ <tscreen> <verb> /etc/rc.vice start </verb> </tscreen> <tag/Windows NT/ <tscreen> <verb> codastart </verb> </tscreen> </descrip> Now observe the log: <tscreen> <verb> xterm -e tail -f /vice/srv/SrvLog & </verb> </tscreen> The <tt/SrvLog/ should show <em/File Server started/. If not, you have a problem. Determine with <tt/ps/ that <tt/codasrv/, <tt/auth2/, <tt/rpc2portmap/, <tt/updatesrv/ and <tt/updateclnt/ are running. <sect2>Making your root volume During your configuration session, you commmunicated a name for the root volume to the program. This root volume now needs to be craeted: the precise command to do this was printed out by the vice-setup program, below we assume your file space is in <tt>/vicepa</tt>, and your root volume is <tt/coda:root/. <tscreen> <verb> createvol_rep coda:root E0000100 /vicepa </verb> </tscreen> <bf/NOTE:/ E0000100 is the Volume Storage Group set up for you by vice-setup. With more servers you can define other groups in /vice/db/VSGDB -- see the Coda User Manual. <sect1>Connecting your client to a new Coda server Now you are ready to point a Venus (client) at this server. You do this by typing <tscreen> <verb> venus-setup server-name cache-szie-in-kb </verb> </tscreen> <bf/NOTE:/ Windows 95 users should type the IP address and not the hostname of the server. Start Venus as explained above. From the coda client side, the root volume will appear under /coda. To use it you must now authenticate to Coda, since it is write protected. The vice-setup program installed an administrative Coda user on the server. It has a uid you chose and has been assigned password changeme. You may clog into Coda with this uid: <tscreen> <verb> clog "adminuser" </verb> </tscreen> Validate that you have tokens with the <tt/ctokens/ utility. You can now create files in <tt/coda/, because the administrative user is on the access control list (ACL) of the <tt>/coda</tt> directory. Read the next section to find out how to do more with Coda. <sect>Exploring Coda's features <sect1> Restarting a server On the server the command <tt/volutil shutdown/ will stop the server. If it is really toasted, you might need to kill the <em/codasrv/ process. To restart the server issue the command <tt/startserver/. This shell script invokes <tt/codasrv/ with the correct arguments. If it complains about a server already running do the following: first check with <tt>ps auxww | grep codasrv</tt> if this is the case. If not, remove the file <tt>/vice/srv/pid</tt> and reissue the <tt/startserver/ command. <sect1> Getting more volumes It is a good idea to create a few extra volumes on your server and mount these in the coda directory tree. In particular if you want to explore reintegration, conflict resolution and replication servers we recommend that you do that in new volumes and not in the volume mounted on the root of the Coda filesystem. To make a new volume: <enum> <item> make sure your codasrv, updatecln and updatesrv are running. <item> type <tt/createvol_rep volname vsgaddr partitionname/. Typically you determine your own volume name, the VSG address can be found in /vice/db/VSGDB and the partition name will be /vicepa. <item> mount the volume using <tt/cfs makemount path volumename/. This creates the mount point, and mounts the volume. </enum> To <bf/explore volumes/ you can use other cfs commands, such as <tt/cfs whereis path/ and <tt/cfs listvolume path/. To see the FID of a file type <tt/cfs getfid path/. <sect1> Adding a user This is currently slightly involved. Follow the following steps: <enum> <item> add the user to /vice/db/user.coda <item> add the user to groups in /vice/db/group.coda (do not separate users with commas) <item> run <tt> cd /vice/db; pwd2pdb -u user.coda -g group.coda > vice.pdb </tt> <item> run <tt/pcfgen vice.pdb/. <item> Give the new user a password by using the <tt/au nu/ program (from either a client or a server). </enum> The user should now be able to login to Coda just like the first user set up at installation time. <sect1> Exploring ACL's. You can set and list Access Control Lists on directories using <tt/cfs setacl dir user rights [user rights...]/. To show the ACLs type <tt/cfs listacl directory/. <sect1> Monitoring Coda Use <tt/codacon/ to see many RPC's and a few other actions taken by the client. The file <tt>/usr/coda/etc/console</tt> also has interesting information. To see how a server is doing use <tt/cmon/, in the form <tt/cmon server:25/, 25 depicts the width of the column used by the server. <sect1> Using Coda disconnected and reintegrating. First get a new volume to try this with. We <bf/strongly/ recommend not using the root volume to explore disconnected operation. See below how to add a volume. <enum> <item> change directory into this volume. Do "ls -l" on the directory where you want to start work while disconnected. Get tokens using <tt/clog/. <item> type <tt/cfs disconnect/ or disconnect your network. After 30 seconds you can see that <em/codacon/ (which you should always run) displays that your server is not reachable anymore. <item> do some work disconnected, create some files, edit them etc. <item> type <tt/cfs reconnect/. Venus will discover that the net is up, but you can speed that up by typing <tt/cfs checkservers/. Monitoring codacon, you will see that your changes are being re-integrated. </enum> <sect1> Repairing conflicts Files and directories can get into conflict due to disconnections of clients or servers from the net, as well as through overlapping open/write/close sequences on two clients. A object that is in conflict is representd as a dangling symbolic link and <tt/X->@vol.vnode.unique/ is the file the symlinks points to. How do we get rid of this conflict? <enum> <item> type <tt/cfs beginrepair X/. This changes X from a symlink into a directory. By doing <tt/ls X/ you will see either <tt/local global/ or <tt/server1 server2 server3/. In the first case we have a local global conflict and in the second case a server server conflict. <item> If the objects in the directory <tt/X/ are files, you have a file conflict, they can also be directories, in which case you can find the content underneath. <item> If you are nervous, this is a good moment to make copy of your files. They can be found under the directory X while the repair session is in progress. <item> <tt/cfs endrepair X/ closes up the repair session. <item> All local global conflicts are repaired with <tt/repair/. Type <tt/repair/ and follow its cryptic instructions. <item> Server-server conflicts on files are fixed with either <tt/filerepair/ or with <tt/removeinc/. Server-server conflicts on directories are fixed with <tt/repair/. </enum> <sect1> Exploring replication First you will have to add a second server to your Coda cluster. Install the software and use <tt/vice-setup/ again. This time your server is <em/not/ going to be the SCM. Proceed answering the questions until done. On the SCM add the following: <enum> <item> Your server needs a server number, to be added to the <tt>/vice/db/servers</tt> file <bf/ON THE SCM/. <item> Make two new entries in the <tt>/vice/db/VSGDB</tt> file. One for your new server by itself, one of the form: <tt/E0000104 scm-server second-server/. <item> Start updatesrv and updateclnt on the second server. <item> Start codasrv on the second server <item> Make a new volume <bf/from the SCM/ using <tt/createvol_rep/ giving the address of the volume as <tt/E0000104/. <item> Mount the volume as above. </enum> You can now use this volume and your files will automatically be stored on multiple servers. To temporarily disable a server, and see that things continue to function normally, either shut the server down with <tt/volutil shutdown/ or disconnect its network. You can also isolate the server using our network filters with <tt/filcon isolate -s server-name/. Using <tt/filcon clear server-name/ clears the filter. Modifications made to coda files during the server outage will be resolved when the files are first accessed. You see message of the form <tt/Resolve path/ in the codacon output. By typing <tt/cfs checkservers/ you can see if the server is available again. <sect>Troubleshooting The Coda filesystem is still under development, and there certainly are several bugs which can crash both clients and servers. However, many problems users observe are related to semantical differences of the Coda filesystem compared to well-known NFS or SMB network filesystems. This section will point out several logs to look at for identifying the cause of problems. Even if the source of the problem cannot be found, the information gathered from Coda's logging mechanisms will make it easier for people on the coda mailinglist <htmlurl url="mailto:coda-discuss@coda.cs.cmu.edu" name="<coda-discuss@coda.cs.cmu.edu>"> to assist in solving the problem(s). Some of the more common problems are illustrated in detail. At the end of this section some of the more involved debugging techniques will be addressed. This will be helpful to developers to isolate problems more easily. At the end there is a whole section describing how to solve some problems with Windows95, <bf/only the Coda related stuff!/. <sect1>Basic Troubleshooting Most problems can be solved, or at least recognized by using the information logged by the clients and servers. The first step in finding out where the problems stems from is doing a <bf/tail -f/ on the logfiles. It must also be noted that, when coda clients and servers crash they do not `dump core', but start sleeping so that we developers can attach debuggers. As a result, a crashed client or server still shows up in the <bf/ps auxwww/ output, and only the combination of lack of file-service and error messages in logfiles indicate that something is really wrong. <sect2>Client logging <itemize> <item> <bf/codacon/ is a program which connects to venus and provides the user with run-time information. It is the initial source of information, but cannot be used to look back into the history. It is therefore advisable to always have a codacon running in a dedicated xterm. <tscreen> client$ xterm -e codacon </tscreen> <item> <tt>/usr/coda/etc/console</tt> is a logfile which contains mostly error or warning messages, and is a place to look for errors which might have occured. When assertions in the code fail, it is logged here. <item> <tt>/usr/coda/venus.cache/venus.log</tt> contains more in-depth information about the running system, which can be helpful to find out what the client is or was doing. </itemize> </sect2> <sect2>Server logs <itemize> <item> <bf/cmon/ is an ncurses program that can be run on a client to gather and display statistics from a group of servers. When a server goes down it will not respond to the statistics requests, which makes this a simple method for monitoring server availability. <tscreen> client$ xterm -e cmon server1:100 server2:100 server3:100 ... </tscreen> <item> <tt>/vice/srv/SrvLog and /vice/srv/SrvErr</tt> are the server logfiles. <item> <tt>/vice/auth2/AuthLog</tt> <item> <tt>/vice/srv/portmaplog</tt> <item> <tt>/vice/srv/UpdateClntLog</tt> <item> <tt>/vice/srv/UpdateLog</tt> </itemize> </sect2> </sect1> <sect1>Client Problems <descrip> <tag>Client does not connect to <tt/testserver.coda.cs.cmu.edu/.</tag> When you have set up your client for the first time, and it can not connect to the testserver at CMU, there are a couple of possible reasons. You might be running an old release of coda, check the coda web-site to see what the latest release is. Another common reason is that your site is behind a firewall, which blocks, or allows only outgoing, udp traffic. Either try coda on a machine outside of the firewall, or set up your own server. The third reason is that the testserver might be down, for maintenance or upgrades. That does not happen often, but you can check whether it is up, and how long it has been running using cmon <verb> cmon testserver.cs.cmu.edu:100 </verb> <tag>Trying to access a file returns <em/Connection timed out/ (ETIMEDOUT).</tag> The main reason for getting <em/Connection timed out/ errors is that the volume where the file is located is disconnected from the servers. However, it can also occur in some cases when the client is in write-disconnected mode, and there is an attempt to read a file which is open for writing. See <ref id="Disconnections" name="Volume is disconnected/Volume is write-disconnected"> for more information. <tag/Commands do not return, except by using ^C./ When command are hanging it is likely that venus has crashed. Check <tt>/usr/coda/etc/console</tt> and <tt>/usr/coda/venus.cache/venus.log</tt>. <tag/Venus fails when restarted./ If venus complains (in <tt>venus.log</tt> about not being able to open <tt>/dev/cfs0</tt>, it is because <tt>/coda</tt> is still mounted. <verb> # umount /coda </verb> Another reason for not restarting is that another copy of venus is still around, and venus is unable to open it's network socket. In this case there will be a message in <tt/venus.log/ stating that RPC2_CommInit has failed. <tag/Venus doesn't start./ A reason is that you do not have the correct kernel module. This can be tested by inserting the module by hand, and then listing the available modules. `coda' should show up in that listing. Otherwise reinstall (or recompile) a new module. <verb> # depmod -a # insmod coda.o # lsmod Module Size Used by coda 50488 2 </verb> If the kernel-module can be loaded without errors, check <tt/venus.log/. A message stating `Cannot get rootvolume name' indicated either a misconfigured server or the codasrv/codasrv-se ports are not defined in <tt>/etc/services</tt>, which should contain the following entries. <verb> rpc2portmap 369/tcp rpc2portmap 369/udp codaauth2 370/tcp codaauth2 370/udp venus 2430/tcp venus 2430/udp venus-se 2431/tcp venus-se 2431/udp codasrv 2432/tcp codasrv 2432/udp codasrv-se 2433/tcp codasrv-se 2433/udp </verb> </descrip> </sect1> <sect1>Server Problems <descrip> <tag>server doesn't start due to salvaging problems If this happens you have several options. If the server has crashed during salvaging it will not come up by trying again, you must either repair the damaged volume or not attach that volume. Not attaching the volume is done as follows. Find the volume id of the damaged volume in the SrvLog. Create a file named <tt> /vice/vol/skipsalvage</tt> with the lines: <verb> 1 0xdd000123 </verb> Here <tt/1/ indicates that a single volume is to be skipped and <tt/0xdd000123/ is the volume id of the replica that should not be attached. If this volume is a replicated volume, take all replicas offline, since otherwise the clients will get very confused. You can also try to repair the volume with <tt/norton/. Norton is invoked as: <verb> norton LOG DATA DATA-SIZE </verb> These parameters can be found in /vice/srv.conf. The Norton manual pages give details about norton's operation and there is online guidance available which is possibly more helpful. <bf/NOTES:/ <enum> <item> Often corruption is replicated. This means that if you find a server has crashed and does not want to salvage a volume, your other replicas may suffer the same fate: the risk is that you may have to go back to tape (you do make tapes, right?). Therefore <bf/first/ copy out good data from the available replicas, <bf/then/ attend to repairing or skipping them in salvage. <item> Very often you have to take both a volume and its most recent clone (generated during backup) offline, since corruption in a volume is inherited by the clone. <item> If you find that a replica of a volume is corrupt, <em/do not attempt to merely replace that replica/. We have found that this corrupts the volume databases. It is better to make a new replicated volume and copy of the data from the healthy replicas (keep the server with the bad replica down). </enum> <tag> How to restore a backup from tape Tuesday I lost my email folder - the whole volume <em/moose:braam.life/ was corrupted on server <em/moose/, it wouldn't salvage. Here is how I got it back. First I tried mounting <tt/moose.braam.life.0.backup/ but this was corrupted too. On the SCM in <tt>/vice/vol/VRList</tt> I found the replicated volume number <tt/f0000427/ and the volume number <tt/ce000011/ (ficitious) for the volume. I logged in as root to bison, our backup controlller. I read the backuplog for Tuesday morning in /vice/backuplogs/backuplog.DATE and saw that the incremental dump for August 31st had been fine. At the end of that log, I saw the name <tt/f0000427.ce000011/ listed as dumped under /backup (a mere symlink) and /backup2 as spool directory with the actual file. The backup log almost shows how to move the tape to the correct place and invoke restore: <verb> cd /backup2 mt -f /dev/nst0 rewind restore -b 500 -f /dev/nst0 -s 3 -i </verb> The <tt/-s 3/ option varies according to which /backup[123] volume the backup is restored from. This invokes the restore command. Typing help allowed me to add then extract the file I wanted. It took a little while before the file was back. From the restore prompt do: <verb> restore> cd 31Aug1998 restore> add viotti.coda.cs.cmu.edu-f0000427.ce000011 restore> extract Specify volume #: 1 <verb> In /vice/db/dumplist I saw that the last full backup had been on Friday Aug28. I went to the machine room and inserted that tape (recent tapes are above bison). This time f0000427.ce000011 was a 200MB file (the last full dump) in /backup3. I extract the file as above. Then I merged the two dumps: <verb> merge /restore/peter.mail /backup2/28Aug1998/f0000427.ce000011 \ /backup3/31Aug1998/f0000427.ce000011 </verb> This took a minute or two to create /restore/peter.mail. Now all that was needed was to upload that to a volume: <verb> volutil -h moose restore /restore/peter.mail /vicepa vio:braam.mail.restored </verb> Back to the SCM, to update the volume databases: <verb> bldvldb.sh viotti </verb> Now I could mount the restored volume: <verb> cfs mkm restored-mail vio:braam.mail.restored </verb> and copy it into a read write volume using cpio or tar. <tag>createvol_rep reports RPC2_NOBINDING.</tag> When trying to create volumes, and createvol_rep reports RPC2_NOBINDING, it is an indication that the server is not (yet) accepting connections. It is useful to look at <tt>/vice/srv/SrvLog</tt>, the server performs the equivalent of <tt>fsck</tt> on startup, which might take some time. Only when the server logs `Fileserver Started' in SrvLog, it starts accepting incoming connections. Another reason is that an old server is still around, blocking the new server from accessing the network ports. <tag>RPC2_DUPLICATESERVER in the rpc2portmap/auth2 logs</tag> Some process has the UDP port open which rpc2portmap or auth2 is trying to obtain. In most cases this is an already running copy of rpc2portmap or auth2. Kill all running copies of the program in question and restart them. <tag>Server crashed shortly after updating files in <tt>/vice/db</tt>.</tag> Servers can crash when they are given inconsistent or bad data-files. You should check whether <tt/updateclnt/ and <tt/updatesrv/ are both running on the SCM and the machine that has crashed. You can kill and restart them. Then restart <tt/codasrv/ and it should come up. <tag/Users cannot authenticate or created volumes are not mountable./ Check whether auth2, updateclnt, and updatesrv are running on all fileservers. Also check their logfiles for possible errors. </descrip> </sect1> <sect1>Disconnections. <label id="Disconnections"> As most common problems are related to the semantical differences arising as a result of `involuntary' disconnections, this section contains some background information of why volumes become disconnected or write-disconnected. And how to get them to reconnect again. <bf/Volume is fully disconnected./ There are several reasons why a coda client may have disconnected some or all volumes from an accessible server. <itemize> <item> Pending reintegration. When modifications have been made to the volume in disconnected mode, the client will not reconnected the volume until all changes have been reintegrated. Also, reintegration will not occur without proper user authentication tokens. Furthermore, reintegration is suspended as long as there are objects in conflict. The most important item here is to have a <bf/codacon/ process running, since it will give up-to-date information on what venus is doing. Venus will inform the user about missing coda authentication tokens, <em/`Reintegration: pending tokens for user <uid>'/. In this case the user should authenticate himself using the <bf/clog/ command. Conflicts, which require us to use the <bf/repair/ tool, are conveyed using the <em/`local object <pathname> inconsistent'/ message. Otherwise codacon should show messages about <em/backfetches/, and how many modifications were successfully reintegrated. <item> Access permissions. The client may also disconnect when a servers reports an error to an operation, when according to the client this is a valid operation. Causes for this are authentication failure; check tokens using <bf/ctokens/ and optionally obtain new tokens using <bf/clog/. Or inconsistencies between the data cached on the client and the actual data stored on the server; this will reveal itself as an inconsistent object during subsequent reintegration. <item> Lost connections. Sometimes the client does not receive a prompt reply from an accessible server, and marks the server as dead. This will ofcourse disconnect the volume if the last server is lost. Once every five minutes, the client automatically verifies connectivity with all known servers, and can thus recover from lost connections. However, this action can also be triggered by the user by excecuting the <bf/cfs checkservers/ command. If <bf/cfs checkservers/ reports that servers are unreachable, it might be interesting to check with <bf/cmon/ if the server is responding at all, since we might be faced with a crashed server. When a server was considered unreachable, but is successfully contacted after `cfs checkservers', reintegration will automatically start (when a user has tokens, and there are no inconsistencies). </itemize> <bf/Volume is write-disconnected./ <em/Write-disconnected operation/ is used as often as <em/weakly connected mode/ to describe this volume state, and they are effectively the same. This is the special situation where a client observes a weak connectivity with a server, and therefore forces the associated volumes in weakly connected mode. Weakly connected volumes postpone writing to the server to significantly reduce waiting on a slow network connection. Read operations are still serviced by the local cache and the servers, as in fully connected mode. Which is why this mode of operation is also called write-disconnected operation. The write operations are effectively a continuous reintegration (<em/trickle-reintegration/) in the background. This mode, therefore, requires users to be authenticated and gives more chance for possible file conflicts. The following points are several reasons for write-disconnected operation. <itemize> <item> Weak network connectivity. Venus uses bandwidth estimates made by the rpc2 communication layer to decide on the quality of the network connection with the servers. As soon as the connectivity to one of the servers drops to below the weakly connected treshhold (currently 50 KB/s), it will force all volumes associated with that server into weakly-connected mode. The <bf/cfs wr/ command can be used to force the volumes back into fully connected mode, and immediately reintegrate all changes. When the user was not authenticated, or conflicts were created during the write-disconnected operation, the user must first obtain proper authentication tokens or repair any inconsistent objects before the volume becomes fully connected again. Here again <bf/codacon/ is an invaluable tool for obtaining insight into the client's behaviour. <item> User requested write-disconnect mode. Users can ask venus to force volumes in write-disconnected mode, exchanging high consistency for significantly improved performance. By using the <tt/-age/ and <tt/-time/ flags on the <bf/cfs wd/ commandline, some control is given about the speed at which venus performs the trickle-reintegration. For instance, to perform the trickle-reintegrate more quickly than the default, where only mutations to the filesystem older than 15 minutes are reintegrated. You could use <bf/cfs wd -age 5/, which will reintegrate all mutations older than 5 seconds. <item> Pending reintegration. When a volume is write-disconnected, it will stay write-disconnected until a user properly authenticates using <bf/clog/. </itemize> </sect1> <sect1>Advanced Troubleshooting <sect2>with rpc2tcpdump <tt/rpc2tcpdump/ is the regular tcpdump, which is modified to decode rpc2 protocol headers. This makes it a very useful tool for analyzing why programs fail to work. All traffic between <tt/venus/ and the coda servers can be viewed using the following command. <verb> # tcpdump -s120 -Trpc2 port venus or port venus-se </verb> To identify problems with <tt/clog/, for instance which server it is trying to get tokens from. <verb> # tcpdump -s120 -Trpc2 port codaauth </verb> <sect2> debugging with gdb To be able to debug programs that use RVM, most coda related application will go into an endless sleep when something goes really wrong. They print their process-id in the log (f.i. <tt/venus.log/ or <tt/SrvLog/), and a user can attach a debugger to the crashed, but still running, program. <verb> # gdb /usr/sbin/venus `pidof venus` </verb> This makes it possible to get a stack backtrace (<tt/where/), go to a specific stack frame (<tt/frame <x>/), or view the contents of variables, (<tt/print <varname>/). By installing the coda sources in same place as where the binaries were initially built from, it is possible to view the surrounding code fragment from within the debugger using the <tt/list/ command. When using RedHat Linux rpms, you can install the sources in the right place by installing the coda source rpm file. <verb> # rpm -i coda-x.x.x.src.rpm # rpm -bp /usr/src/redhat/SPECS/coda.spec </verb> On other platforms look at the paths reported in the backtrace and unpack the source tarball in the correct place. <verb> (gdb) where #0 CommInit () at /usr/local/src/coda-4.6.5/coda-src/venus/comm.cc:175 #1 0x80fa8c3 in main (argc=1, argv=0xbffffda4) at /usr/local/src/coda-4.6.5/coda-src/venus/venus.cc:168 (gdb) quit # cd /usr/local/src # tar -xvzf coda-4.6.5.tgz </verb> </sect2> </sect1> <sect1>Troubleshooting on Windows 95 <sect2>Common problems <descrip> <tag/Unable to mount Coda./ It is only possible to mount coda when <tt>relay.exe</tt> and <tt>venus.exe </tt> are running. Do not type <tt>mount.exe n:</tt> before venus printed the message <tscreen> <verb>venus starting...</verb></tscreen> <tag/Unable to shutdown Windows95./ This occurs when the device <tt>codadev.vxd</tt> couldn't be unloaded on the request issued by typing <tt>unmount.exe</tt>. <tt>relay.exe</tt> printed the message <tscreen><verb> UNLOAD RESULT 0 UNLOAD RESULT 8 </verb></tscreen> when the request was successful. There is no way out of this. Just reboot your machine. <tag>I cannot reboot Windows95 and I think it is due to the VXDs loaded for Coda.</tag> Boot your System in DOS mode by pressing F8 on boot time. Cd to the windows directory and type <tt>edit system.ini</tt>. In the section <tt>[enh386]</tt> you will find the entries <tscreen> <verb> device=c:\usr\coda\bin\mmap.vxd device=c:\usr\coda\bin\mcstub.vxd </verb> </tscreen> Comment them out by using a <tt>;</tt> in front of the lines. Try to restart Windows again. <tag/How can I find out why <tt>relay.exe</tt> crashed./ <tt>relay.exe</tt> is a very tiny program which hands requests from the file system driver up to <tt>venus.exe</tt> and the other way around. Crashes in <tt>relay.exe</tt> are most likely due to buffer overflows or null pointer referencing. <tt>relay.exe</tt> prints debug information in a file called <tt>relay.log</tt> which can be found in the directory where relay was started from. When <tt>relay.exe</tt> crashed, the file system driver is still loaded. Just restart <tt>relay.exe</tt>. It might pretty well not be possible to unmount the system by calling <tt>unmount.exe</tt>. <tag>How can I find out why <tt>venus.exe</tt> crashed.</tag> See troubleshooting venus. When this happens it is possible to restart venus. The file system doesn't need to be unmounted before. <tag/How can I find out more about what has happend/ Look in the file <tt>c:\vxd.log</tt>. The file system driver <tt>codadev.vxd</tt> prints information about all requests and answers in this file. Check the requestnumbers to match with those in <tt>relay.log</tt>. </descrip> </sect2> <sect2>Restrictions <itemize> <item> <tt>hoard</tt>.exe does not work so far. <item> Handling large files (in particular executables) does not work well in a low bandwidth scenario. <item> <tt>cfs.exe</tt> uses absolute pathnames so far. <item> Long filenames are not supported under DOS environment yet. You can access files, but you need to use the long filenames. </itemize> </sect2> </sect1> </sect> <sect> Building Coda As a file system, Coda has several major components that need to be built: <itemize> <item> Kernel code on the client <item> The cache manager Venus on the client <item> The fileserver <item> Utilities for client and server administration. </itemize> <sect1> Building on Linux <sect2> Building the kernel module <label id="Modules"> We now have a reaonably flexible method to build kernel modules. You can build the module for a kernel which you are not running at the time of the build. First of all get the linux-coda-?.?.?.tgz archive from <tt>ftp://ftp.coda.cs.cmu.edu/pub/coda/src/</tt>. You can unpack this anywhere on your target system. <bf/Prepare the kernel tree/ You do need a kernel tree handy, to give the module header information. To get ready: <tscreen> <verb> make oldconfig make dep </verb> </tscreen> In the top directory of linux-coda build the coda.o module: <tscreen> <verb> make config --- answer the questions make coda.o su make install depmod -a </verb> </tscreen> <bf/Notes/ <itemize> <item> If you build for a running kernel, you must still have a source tree for that Linux release, otherwise the headers cannot be found. However, that source tree doesn't need modversions.h etc. <item> We don't actively maintain 2.0 code anymore. The 2.1 code is preferred, but not yet perfect. </itemize> <sect2> Building the userlevel code Coda builds most easily on <em/glibc/ systems. We do not actively maintain the <tt/libc/ variant anymore. You will need several libraries to link the Coda binaries, and include files from some of the lib???-devel packages. Readline and termcap are probably the most important ones. To build: <itemize> <item>Unpack <tt>coda-?.?.?.tgz</tt>. <item>./configure --prefix=/usr <item> make (make coda for versions 4.6 and older) <item> make client-install <item> make server-install </itemize> <sect1> Building on FreeBSD <sect2> Building a Whole Kernel The Coda kernel files are in the FreeBSD -current kernel sources. All you need to do to get a Coda capable kernel is to build a configuration that includes Coda. The lines to enable Coda are not in the GENERIC. There are several Coda lines in the LINT file that you need to copy. In the i386/conf directory type: <tscreen> <verb> grep -i coda LINT </verb> </tscreen> It currently gives: <tscreen> <verb> # Coda stuff: options CODA #CODA filesystem. pseudo-device vcoda 4 #coda minicache <-> venus comm. </verb> </tscreen> Add these lines (or whatever the <tt/grep/ yields) to the GENERIC kernel configuration or your standard personal configuration. You might also want to turn on DDB and BPF. I'll assume you call the new config file CODA. You type: <tscreen> <verb> config CODA </verb> </tscreen> Then <tt/cd/ to the build area and do a <tt/make/: <tscreen> <verb> cd ../../compile/CODA make </verb> </tscreen> Finally, as root install the kernel with: <tscreen> <verb> make install </verb> </tscreen> <sect2> Building an LKM Alternatively, you could just build an lkm for Coda and insert it into the kernel. To do that you: <tscreen> <verb> cd /usr/lkm/coda </verb> </tscreen> and type: <tscreen> <verb> make make install </verb> </tscreen> This will build coda_mod.o and copy it to /lkm. You load it by typing: <tscreen> <verb> modload -v -e coda_mod -o /var/run/lkm.coda /lkm/coda_mod.o </verb> </tscreen> Remember, when using an lkm: <itemize> <item> You must boot /kernel not /kernel.old or /kernel.<something> <item> There are no symbols available for debugging. </itemize> <sect2> Building Coda: The Package Way <tscreen> <verb> cd /usr/ports/net/coda_client make install cd /usr/ports/net/coda_server make install </verb> </tscreen> <bf/NOTE:/ The client and server each need about 85Meg for the work area and another 15Meg to install. They are compiled -g for debugging. <sect2> Building Coda: The Bleading Edge Way First, you need several external packages to build coda: <tt/gnu make/, <tt/gdbm/, <tt/readline/, and <tt/perl/. You can grab any of our (latest) sources from <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/src" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/src" >. For this example, I'll assume that you have obtained sources for Coda release 4.6.6. <bf/NOTE:/ On our ftp site there are also beta, alpha, and less stable source releases available for the daring. Untar the source into your favorite build area. (I am assuming this is <tt>/usr/coda</tt>.) <tscreen> <verb> tar zxfv coda-4.6.6.tgz -C /usr/coda </verb> </tscreen> <tt/Cd/ to <tt>/usr/coda</tt> and <tscreen> <verb> mkdir obj </verb> </tscreen> This is where we will build the binaries. Now <tt/cd/ into <tt/obj/ and type: <tscreen> <verb> ../coda-4.6.6/configure </verb> </tscreen> This will use the gnu configure system to configure and to shadow the sources that were installed in /usr/coda/coda-4.6.6. Finally, <em/assuming you named the gnu make, gmake/, type: <tscreen> <verb> gmake coda </verb> </tscreen> To install the client suite type: <tscreen> <verb> gmake client-install </verb> </tscreen> To install the server suite type: <tscreen> <verb> gmake server-install </verb> </tscreen> <sect1> Building on NetBSD <sect2> Building a Whole Kernel The Coda kernel files are in the NetBSD -current kernel sources. All you need to do to get a Coda capable kernel is to build a configuration that includes Coda. The lines to enable Coda may not be in your configuration. They should be in GENERIC, though they may be turned off. Enable them and/or copy them to your configuration file. They should look like. <tscreen> <verb> file-system CODA # Coda File System pseudo-device vcoda 4 # coda minicache <-> venus comm. </verb> </tscreen> But whatever is in GENERIC will be more up to date. I'll assume you call the new config file CODA. You type: <tscreen> <verb> config CODA </verb> </tscreen> Then <tt/cd/ to the build area and do a <tt/make/ <tscreen> <verb> cd ../compile/CODA make </verb> </tscreen> Then just copy <tt/netbsd/ to root. <sect2> Building an LKM Alternatively, you could just build an lkm for Coda and insert it into the kernel. To do that you: <tscreen> <verb> cd /usr/sys/lkm/vfs/coda </verb> </tscreen> and type: <tscreen> <verb> make make install </verb> </tscreen> This will build coda.o and copy it to /usr/lkm. You load it by typing: <tscreen> <verb> modload -v -e coda_lkmentry -o /var/run/lkm.coda /usr/lkm/coda.o </verb> </tscreen> Remember, when using an lkm: <itemize> <item> You must boot /netbsd not /netbsd.old or /netbsd.<something> <item> There are no symbols available for debugging. </itemize> <sect2> Building Coda: The Package Way <tscreen> <verb> cd /usr/pkgsrc/net/coda_client make install cd /usr/pkgsrc/net/coda_server make install </verb> </tscreen> NOTE: The client and server each need about 85Meg for the work area and another 15Meg to install. They are compiled -g for debugging. <sect2> Building Coda: The Bleading Edge Way First, you need several external packages to build coda: <tt/gnu make/, <tt/gdbm/, <tt/readline/, and <tt/perl/. You can grab any of our (latest) sources from <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/src" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/src" >. For this example, I'll assume that you have obtained sources for Coda release 4.6.6. <bf/NOTE:/ On our ftp site there are also beta, alpha, and less stable source releases available for the daring. Untar the source into your favorite build area. (I am assuming this is <tt>/usr/coda</tt>.) <tscreen> <verb> tar zxfv coda-4.6.6.tgz -C /usr/coda </verb> </tscreen> <tt/Cd/ to <tt>/usr/coda</tt> and <tscreen> <verb> mkdir obj </verb> </tscreen> This is where we will build the binaries. Now <tt/cd/ into <tt/obj/ and type: <tscreen> <verb> ../coda-4.6.6/configure </verb> </tscreen> This will use the gnu configure system to configure and to shadow the sources that were installed in /usr/coda/coda-4.6.6. Finally, <em/assuming you named the gnu make, gmake/, type: <tscreen> <verb> gmake coda </verb> </tscreen> To install the client suite type: <tscreen> <verb> gmake client-install </verb> </tscreen> To install the server suite type: <tscreen> <verb> gmake server-install </verb> </tscreen> <sect1> Building a Client on Windows 95 <bf>WARNING:</bf> The software provided to run Coda on Windows 95 is an early pre-alpha snapshot, made available to those interested. Coda runs kernel level code and priviliged processes which can cause damage to your system. Backup all data before playing with Coda. Coda for Windows was made possible through a major effort by Michael Callahan <tt>mjc@stelias.com</tt>. Michael extended the DOS protected mode environment with memory mapping and TCP/IP socket support (this involves both Windows kernel support as well as low level support in DJGPP start libraries). He also wrote a kernel module for Coda. Michael's work was further debugged, extended and packaged by the Coda team. <bf>Short rationale</bf> Why DOS applications?? It would seem more straightforward to implement the Coda client cache manager, a user level program named Venus, as a Win32 application. Sadly on Windows 95 we ran into the following (fairly well known) problem. When a user application calls a Win32 file system call, the application may acquire a mutex in a win16 system dll. The request should reach the kernel, and make its way up to Venus. Venus is then unable to service the request because it cannot grab the mutex. Deadlock results. (See Schulman's, Unauthorized Windows 95, IDG.) Implementing all of Venus as a kernel level cache manager seemed an invitation for disaster. Instead, by running the cache manager in a virtual DOS machine, as a DOS Protected Mode Interface application, one can bypass these problems, since such applications do not share the Windows dynamic libraries which gain the mutex. The price of following this path is high. There was no freely available socket support for such DPMI applications and no memory mapping support. These are provided by Michaels VXD's and standard library calls are now incorporated in the DJGPP toolchain. <sect2> Building the user level code What do you need: <enum> <item> a Linux machine with a few 100M of free space. <item> download the tool chains made available under: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/tools/" NAME="ftp://ftp.coda.cs.cmu.edu/pub/tools/{95,win32}"> <item> You need the rpms for: <itemize> <item> djgpp package. <item> djgpp-win95ext. <item> cygwin. <item> libgdbm-nt. <item> gdb-djgpp a remote debugging environment for DPMI applications. </itemize> <item>It is highly recommended to set up Samba to share Linux generated binaries with the Windows box. </enum> Get the latest tarball of Coda sources. Example: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/src/" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/src/coda-?.?.?.tgz."> You can rebuild Venus under Linux with <tscreen> <verb> ./configure --host=windows95 make </verb> </tscreen> and the client utilities and server with <tscreen> <verb> ./configure --host=nt </verb> </tscreen> The Venus built here is for NT and is not usable on Win95. <sect2> Building the kernel level code (VXDs) What do you need: <itemize> <item> Visual C++ 5.0 (with Internet Explorer). <item> MS assembler 6.11. <item> The Win95 DDK (comes with professional MSDN). <item> You also need to download a collection of header files vxdtdi.zip for networking from. <URL URL="ftp://ftp.microsoft.com/developr/drg/WinSock/MS-Extensions/VXDTDI.ZIP" NAME="ftp://ftp.microsoft.com/developr/drg/WinSock/MS-Extensions/VXDTDI.ZIP."> </itemize> <bf>Note:</bf>The 95DDK needs the VC2.0 linker, which is on the 95DDK in the directory MSVC20 (also get the files Link.err and DBI.DLL, that will be needed by the linker). You'll also need the latest MS assembler 6.11 which is on the 95DDK in the directory MASM611C. The sources for the VxD's you will build are in: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/src/vc95.zip" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/src/vc95.zip."> To build sock.vxd you'll need the header files provided by microsoft and can be found in VXDTDI.ZIP. You'll have to learn how to build VxDs, which can be done from Developer Studio: <itemize> <item> Change the path of the Linker in all .mak files <item> In the MS-Studio environment (options) add: <itemize> <item> executable path of the assembler 6.11 <item> include path of the 95DDK inc32 (Inc32) <item> include path of the 95DDK inc16 (Inc16) <item> lib path of the 95DDK (lib) <item> include path where VXDTDI Headers are <item> include path of the 95DDK (Block/Inc) </itemize> </itemize> Use the Developer Studio Project "95projects". Build the VXDs one by one. (If you fancy finishing off Michael's nice codastart project, make sure to let us know! It is not essential for now.) <sect2> Installing Coda after build <itemize> <item> Create the directories C:\usr\coda\{bin, etc, spool, venus.cache}. <item> Put Venus.exe Relay.exe, Mount.exe, Unmount.exe, sock.vxd, mc.vxd, mcstub.vxd, mmap.vxd in the directory called C:\usr\coda\bin. <item> Create the file C:\usr\coda\etc\vstab which includes a line like <tscreen> <verb>/coda:/dev/cfs0:<ip adress of server 1>,..,<ip adress of server n>,<cachesize>:1 </verb> </tscreen> Use <tscreen> <verb> /coda:/dev/cfs0:128.2.209.213:100000:1 </verb> </tscreen> to work with the CMU-testserver. <item> Create the file C:\usr\coda\venus.cache\myhost which includes your IP adress. <item> Add the two VXDs mcstub.vxd, mmap.vxd to the C:\Windows\System.ini file in the section 386Enh by typing "device=C:\usr\coda\bin\mcstub.vxd". <item> Add the path <tt>c:\usr\coda\bin</tt> to the PATH environment variable in <tt>c:\autoexec.bat</tt>. <item> Restart the computer. <item> The programs clog.exe, ctokens.exe, cunlog.exe, cfs.exe, hoard.exe, codacon.exe, venus-setup.bat will be build with the coda server for NT. </itemize> You are now at the point where the installer would have left you. To start Coda read the section <ref id="sec-win95-start-conf" name= "Windows95: Starting and Configuring a Coda client">. <sect1> Building and installing a Coda Server for NT <bf>Note:</bf> At present there is no client kernel source code available yet. We will release some very experimental binaries soon. When are a little further with it, we will release the sources, of course. You can however build a Venus (which you can with the supplied kernel Coda FSD for NT) and you can build a server. Coda has alpha support for Coda servers running on NT and Windows 95. The Win32 binaries are constructed using the Cygnus Win32 kit, which effectively translates Unix system calls to Win32 calls. We build the Win32 binaries on Linux workstations using a cross compiler, just for our convenience. At the moment this is not for the faint of heart, since quite a few things have to be done by hand. Of course we encourage playing and will try to help and fix bugs. <sect2> The build process You will need: <enum> <item> Get the cross compiling kit: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/tools/win32/cygwin-b19_glibc-2.i386.rpm" NAME="ftp://ftp.coda.cs.cmu.edu/pub/tools/win32/cygwin-b19_glibc-2.i386.rpm."> <item> <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/tools/win32/libgdbm-nt-1.7.3-2.i386.rpm" NAME="ftp://ftp.coda.cs.cmu.edu/pub/tools/win32/libgdbm-nt-1.7.3-2.i386.rpm."> (These are rpms for RedHat 5.0. Sources are available of course.). <item> Get the coda-?.?.? tar ball <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/src" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/src."> <item> Unpack & build <tscreen> <verb> ./configure --host=nt make coda </verb> </tscreen> <item> drink coffee, ignore or, better even, fix the compiler warnings. </enum> Native building should be possible; you'll have to make a few changes to configs/coda.m4 and create a new configs/Makeconfig.ntnative file; then run autoconf and get cracking. But don't you dare send us patches with ^M characters made by Windoze machines! <sect2>Manual configuration after building <enum> <item> Install into an area for NT binaries on your Linux box still (drink coffee, ignore or better even fix the compiler warnings) Put the binaries in a suitable place using: make BINDIR=TARGET/bin SBINDIR=TARGET/bin server-install <item> get samba running on that RH machine and export the build area for Coda. <item> get a patch to <em/select/ and replace <tscreen> <verb> C:\Cygnus\B19\H-i386-cygwin32\bin\cygwinb19.dll </verb> </tscreen> with the one from: <URL URL="ftp://ftp.coda.cs.cmu.edu/pub/coda/support/nt-patches" NAME="ftp://ftp.coda.cs.cmu.edu/pub/coda/support/nt-patches"> <item> get bash going. It can be started from the "Start" menu. <item> organise your mountpoints for Cywin. You will find the root volume mounted on <tt>c:\</tt>. Change this to <tt>c:\coda</tt> by typing: <tscreen> <verb> umount / mount -b C:\coda / </verb> </tscreen> The -b, for binary mounting is vital. <item>For your convenience add the entry <tscreen> <verb> SET HOME=C:\Coda </verb> </tscreen> in the file <tt>cygnus.bat</tt> somewhere in the Cygwin path. You can also create the file <tt>.bashrc</tt> in the <tt>c:\coda</tt> directory which gets called, when the Cygwin shell gets started. We set the export path (described below) and cd to the home directory that way. <item>Create some directories, create <tt>/bin/sh</tt> and export the path by typing: <tscreen> <verb> mkdir /bin /vice /rvm /vicepa /tmp touch /rvm/LOG touch /rvm/DATA cp //C/Cygnus/B19/H-i386-cygwin32/bin/bash /bin/sh export PATH=/bin:$PATH </verb> </tscreen> <item>Install your Coda stuff build as described above. Suppose the coda-?.?.? area is a mapped drive on the NT machine with drive letter H: type in bash: <tscreen> <verb> cd //H/ cp * /bin </verb> </tscreen> or use the Windows Explorer. The binary files might not have the extension <tt>.exe</tt>. In that case just change the name, but distinguish between bash shell scripts and binaries. </enum> You are now at the point where the installer would have left you. To configure and start the server read the section <ref id="codasrv" name="Codaserv -- the server">. </article>