Refactor the huge README.md into the more structured mkdocs.

I also cleaned up the installation section and added links to various competing redo implementations. The new README.md is basically just link to the docs on readthedocs.org, and a link to the mailing list. These docs need a *lot* more work, but this is enough of an improvement that I'll commit it anyway for now.
2018-11-16 03:13:33 -05:00 · 2018-11-16 03:13:33 -05:00 · d0607d0091
commit d0607d0091
parent bde3b5526f
11 changed files with 1587 additions and 1477 deletions
--- a/Documentation/Contributing.md
+++ b/Documentation/Contributing.md
@ -0,0 +1,57 @@
+# License
+
+My version of redo was written without ever seeing redo code by Bernstein or
+Grosskurth, so I own the entire copyright.  It's distributed under the GNU
+LGPL version 2.  You can find a copy of it in the file called LICENSE.
+
+minimal/do is in the public domain so that it's even easier
+to include inside your own projects for people who don't
+have a copy of redo.
+
+
+# How can I help?
+
+Nowadays, redo is good enough for real production use, and some people
+are using it for real work.  That said, it has
+not reached version 1.0 and there are surely still bugs.
+
+If you run into a problem, it's really helpful if you report it to the
+mailing list below (with or without subscribing first).  We really want to
+know if redo is acting weird for you.  Even if the problem turns out to be
+operator error, we can use that information to try to improve this
+documentation.
+
+Small feature additions are also welcome, but you might want to ask on the
+mailing list before you start working on it.  The code is still evolving and
+might not be the same by the time you submit your pull request.
+
+The best things you can do for redo are:
+
+- Convert your projects to using it.  Without users, no project is
+  successful.
+
+- Build new infrastructure around redo, especially things to make it easier
+  for people to get started.  For example, an automake-like tool that filled
+  in default redo build rules for common program types would probably be
+  very popular.
+
+- Tell your friends!
+
+
+# Mailing list
+
+You should join the `redo-list@googlegroups.com` mailing list.
+
+You can find the mailing list archives here:
+<http://groups.google.com/group/redo-list>
+
+It might not look like it, but you can subscribe without having a
+Google Account.  Just send a message to
+`redo-list+subscribe@googlegroups.com` (note the plus sign).
+	
+It's okay to send a message directly to the mailing list
+without subscribing first.  If you reply to someone who writes to the
+list, please leave them in the cc: list, since if they
+haven't subscribed, they won't get your reply otherwise. 
+
+
--- a/Documentation/Cookbook.md
+++ b/Documentation/Cookbook.md
@ -0,0 +1 @@
+Not written yet!  Sorry.
--- a/Documentation/FAQBasics.md
+++ b/Documentation/FAQBasics.md
@ -0,0 +1,116 @@
+# Does redo make cross-platform builds easy?
+
+A lot of build systems that try to replace make do it by trying to provide a
+lot of predefined rules.  For example, one build system I know includes
+default rules that can build C++ programs on Visual C++ or gcc,
+cross-compiled or not cross-compiled, and so on.  Other build systems are
+specific to ruby programs, or python programs, or Java or .Net programs.
+
+redo isn't like those systems; it's more like make.  It doesn't know
+anything about your system or the language your program is written in.
+
+The good news is: redo will work with *any* programming language with about
+equal difficulty.  The bad news is: you might have to fill in more details
+than you would if you just use ANT to compile a Java program.
+
+So the short version is: cross-platform builds are about equally easy in
+make and redo.  It's not any easier, but it's not any harder.
+
+It would be possible to make an automake-like or cmake-like tool that
+generates .do files for your project, just like automake generates
+Makefiles.  But that's beyond the scope of redo itself.
+
+
+# Can I set my dircolors to highlight .do files in ls output?
+
+Yes!  At first, having a bunch of .do files in each
+directory feels like a bit of a nuisance, but once you get
+used to it, it's actually pretty convenient; a simple 'ls'
+will show you which things you might want to redo in any
+given directory.
+
+Here's a chunk of my .dircolors.conf:
+
+	.do 00;35
+	*Makefile 00;35
+	.o 00;30;1
+	.pyc 00;30;1
+	*~ 00;30;1
+	.tmp 00;30;1
+
+To activate it, you can add a line like this to your .bashrc:
+
+	eval `dircolors $HOME/.dircolors.conf`
+
+
+# Do end users have to have redo installed in order to build my project?
+
+No.  We include a very short and simple shell script
+called `do` in the `minimal/` subdirectory of the redo project.  `do` is like
+`redo` (and it works with the same `*.do` scripts), except it doesn't
+understand dependencies; it just always rebuilds everything from the top.
+
+You can include `do` with your program to make it so non-users of redo can
+still build your program.  Someone who wants to hack on your program will
+probably go crazy unless they have a copy of `redo` though.
+
+Actually, `redo` itself isn't so big, so for large projects where it
+matters, you could just include it with your project.
+
+
+# Recursive make is considered harmful.  Isn't redo even *more* recursive?
+
+You probably mean [this 1997 paper](http://miller.emu.id.au/pmiller/books/rmch/)
+by Peter Miller.
+
+Yes, redo is recursive, in the sense that every target is built by its own
+`.do` file, and every `.do` file is a shell script being run recursively
+from other shell scripts, which might call back into `redo`.  In fact, it's
+even more recursive than recursive make.  There is no
+non-recursive way to use redo.
+
+However, the reason recursive make is considered harmful is that each
+instance of make has no access to the dependency information seen by the
+other instances.  Each one starts from its own Makefile, which only has a
+partial picture of what's going on; moreover, each one has to
+stat() a lot of the same files over again, leading to slowness.  That's
+the thesis of the "considered harmful" paper.
+
+It turns out that [non-recursive make should also be considered harmful](https://www.microsoft.com/en-us/research/wp-content/uploads/2016/03/hadrian.pdf).
+The problem is Makefiles aren't
+very "hygienic" or "modular"; if you're not running make recursively, then
+your one copy of make has to know *everything* about *everything* in your
+entire project.  Every variable in make is global, so every variable defined
+in *any* of your Makefiles is visible in *all* of your Makefiles.  Every
+little private function or macro is visible everywhere.  In a huge project
+made up of multiple projects from multiple vendors, that's just not okay.
+Plus, if all your Makefiles are tangled together, make has
+to read and parse the entire mess even to build the
+smallest, simplest target file, making it slow.
+
+`redo` deftly dodges both the problems of recursive make
+and the problems of non-recursive make.  First of all,
+dependency information is shared through a global persistent `.redo`
+database, which is accessed by all your `redo` instances at once. 
+Dependencies created or checked by one instance can be immediately used by
+another instance.  And there's locking to prevent two instances from
+building the same target at the same time.  So you get all the "global
+dependency" knowledge of non-recursive make.  And it's a
+binary file, so you can just grab the dependency
+information you need right now, rather than going through
+everything linearly.
+
+Also, every `.do` script is entirely hygienic and traceable; `redo`
+discourages the use of global environment variables, suggesting that you put
+settings into files (which can have timestamps and dependencies) instead. 
+So you also get all the hygiene and modularity advantages of recursive make.
+
+By the way, you can trace any `redo` build process just by reading the `.do`
+scripts from top to bottom.  Makefiles are actually a collection of "rules"
+whose order of execution is unclear; any rule might run at any time.  In a
+non-recursive Makefile setup with a bunch of included files, you end up with
+lots and lots of rules that can all be executed in a random order; tracing
+becomes impossible.  Recursive make tries to compensate for this by breaking
+the rules into subsections, but that ends up with all the "considered harmful"
+paper's complaints.  `redo` runs your scripts from top to bottom in a
+nice tree, so it's traceable no matter how many layers you have.
--- a/Documentation/FAQImpl.md
+++ b/Documentation/FAQImpl.md
@ -0,0 +1,311 @@
+# Hey, does redo even *run* on Windows?
+
+FIXME:
+Probably under cygwin.  But it hasn't been tested, so no.
+
+If I were going to port redo to Windows in a "native" way,
+I might grab the source code to a posix shell (like the
+one in MSYS) and link it directly into redo.
+
+`make` also doesn't *really* run on Windows (unless you use
+MSYS or Cygwin or something like that).  There are versions
+of make that do - like Microsoft's version - but their
+syntax is horrifically different from one vendor to
+another, so you might as well just be writing for a
+vendor-specific tool.
+
+At least redo is simple enough that, theoretically, one
+day, I can imagine it being cross platform.
+
+One interesting project that has appeared recently is
+busybox-w32 (https://github.com/pclouds/busybox-w32).  It's
+a port of busybox to win32 that includes a mostly POSIX
+shell (ash) and a bunch of standard Unix utilities.  This
+might be enough to get your redo scripts working on a win32
+platform without having to install a bunch of stuff.  But
+all of this needs more experimentation.
+
+
+# Can a *.do file itself be generated as part of the build process?
+
+Not currently.  There's nothing fundamentally preventing us from allowing
+it.  However, it seems easier to reason about your build process if you
+*aren't* auto-generating your build scripts on the fly.
+
+This might change someday.
+
+
+# How does redo store dependencies?
+
+At the toplevel of your project, redo creates a directory
+named `.redo`.  That directory contains a sqlite3 database
+with dependency information.
+
+The format of the `.redo` directory is undocumented because
+it may change at any time.  Maybe it will turn out that we
+can do something simpler than sqlite3.  If you really need to make a
+tool that pokes around in there, please ask on the mailing
+list if we can standardize something for you.
+
+
+# Isn't using sqlite3 overkill?  And un-djb-ish?
+
+Well, yes.  Sort of.  I think people underestimate how
+"lite" sqlite really is:
+
+	root root 573376 2010-10-20 09:55 /usr/lib/libsqlite3.so.0.8.6
+
+573k for a *complete* and *very fast* and *transactional*
+SQL database.  For comparison, libdb is:
+
+	root root 1256548 2008-09-13 03:23 /usr/lib/libdb-4.6.so
+
+...more than twice as big, and it doesn't even have an SQL parser in
+it!  Or if you want to be really horrified:
+
+	root root 1995612 2009-02-03 13:54 /usr/lib/libmysqlclient.so.15.0.0
+
+The mysql *client* library is two megs, and it doesn't even
+have a database server in it!  People who think SQL
+databases are automatically bloated and gross have not yet
+actually experienced the joys of sqlite.  SQL has a
+well-deserved bad reputation, but sqlite is another story
+entirely.  It's excellent, and much simpler and better
+written than you'd expect.
+
+But still, I'm pretty sure it's not very "djbish" to use a
+general-purpose database, especially one that has a *SQL
+parser* in it.  (One of the great things about redo's
+design is that it doesn't ever need to parse anything, so
+a SQL parser is a bit embarrassing.)
+
+I'm pretty sure djb never would have done it that way.
+However, I don't think we can reach the performance we want
+with dependency/build/lock information stored in plain text
+files; among other things, that results in too much
+fstat/open activity, which is slow in general, and even
+slower if you want to run on Windows.  That leads us to a
+binary database, and if the binary database isn't sqlite or
+libdb or something, that means we have to implement our own
+data structures.  Which is probably what djb would do, of
+course, but I'm just not convinced that I can do a better
+(or even a smaller) job of it than the sqlite guys did.
+
+Most of the state database stuff has been isolated in
+state.py.  If you're feeling brave, you can try to
+implement your own better state database, with or without
+sqlite.
+
+It is almost certainly possible to do it much more nicely
+than I have, so if you do, please send it in!
+
+
+# What hash algorithm does redo-stamp use?
+
+It's intentionally undocumented because you shouldn't need
+to care and it might change at any time.  But trust me,
+it's not the slow part of your build, and you'll never
+accidentally get a hash collision.
+
+
+# Why not *always* use checksum-based dependencies instead of timestamps?
+
+Some build systems keep a checksum of target files and rebuild dependents
+only when the target changes.  This is appealing in some cases; for example,
+with ./configure generating config.h, it could just go ahead and generate
+config.h; the build system would be smart enough to rebuild or not rebuild
+dependencies automatically.  This keeps build scripts simple and gets rid of
+the need for people to re-implement file comparison over and over in every
+project or for multiple files in the same project.
+
+There are disadvantages to using checksums for everything
+automatically, however:
+
+- Building stuff unnecessarily is *much* less dangerous
+  than not building stuff that should be built.  Checksums
+  aren't perfect (think of zero-byte output files); using
+  checksums will cause more builds to be skipped by
+  default, which is very dangerous.
+
+- It makes it hard to *force* things to rebuild when you
+  know you absolutely want that.  (With timestamps, you can
+  just `touch filename` to rebuild everything that depends
+  on `filename`.)
+  
+- Targets that are just used for aggregation (ie. they
+  don't produce any output of their own) would always have
+  the same checksum - the checksum of a zero-byte file -
+  which causes confusing results.
+
+- Calculating checksums for every output file adds time to
+  the build, even if you don't need that feature.
+  
+- Building stuff unnecessarily and then stamping it is
+  much slower than just not building it in the first place,
+  so for *almost* every use of redo-stamp, it's not the
+  right solution anyway.
+  
+- To steal a line from the Zen of Python: explicit is
+  better than implicit.  Making people think about when
+  they're using the stamp feature - knowing that it's slow
+  and a little annoying to do - will help people design
+  better build scripts that depend on this feature as
+  little as possible.
+  
+- djb's (as yet unreleased) version of redo doesn't
+  implement checksums, so doing that would produce an
+  incompatible implementation.  With redo-stamp and
+  redo-always being separate programs, you can simply
+  choose not to use them if you want to keep maximum
+  compatibility for the future.
+  
+- Bonus: the redo-stamp algorithm is interchangeable.  You
+  don't have to stamp the target file or the source files
+  or anything in particular; you can stamp any data you
+  want, including the output of `ls` or the content of a
+  web page.  We could never have made things like that
+  implicit anyway, so some form of explicit redo-stamp
+  would always have been needed, and then we'd have to
+  explain when to use the explicit one and when to use the
+  implicit one.
+  
+Thus, we made the decision to only use checksums for
+targets that explicitly call `redo-stamp` (see previous
+question).
+
+I suggest actually trying it out to see how it feels for
+you.  For myself, before there was redo-stamp and
+redo-always, a few types of problems (in particular,
+depending on a list of which files exist and which don't)
+were really annoying, and I definitely felt it.  Adding
+redo-stamp and redo-always work the way they do made the
+pain disappear, so I stopped changing things.
+
+
+# Why doesn't redo by default print the commands as they are run?
+
+make prints the commands it runs as it runs them.  redo doesn't, although
+you can get this behaviour with `redo -v` or `redo -x`. 
+(The difference between -v and -x is the same as it is in
+sh... because we simply forward those options onward to sh
+as it runs your .do script.)
+
+The main reason we don't do this by default is that the commands get
+pretty long
+winded (a compiler command line might be multiple lines of repeated
+gibberish) and, on large projects, it's hard to actually see the progress of
+the overall build.  Thus, make users often work hard to have make hide the
+command output in order to make the log "more readable."
+
+The reduced output is a pain with make, however, because if there's ever a
+problem, you're left wondering exactly what commands were run at what time,
+and you often have to go editing the Makefile in order to figure it out.
+
+With redo, it's much less of a problem.  By default, redo produces output
+that looks like this:
+
+	$ redo t
+	redo  t/all
+	redo    t/hello
+	redo      t/LD
+	redo      t/hello.o
+	redo        t/CC
+	redo    t/yellow
+	redo      t/yellow.o
+	redo    t/bellow
+	redo    t/c
+	redo      t/c.c
+	redo        t/c.c.c
+	redo          t/c.c.c.b
+	redo            t/c.c.c.b.b
+	redo    t/d
+
+The indentation indicates the level of recursion (deeper levels are
+dependencies of earlier levels).  The repeated word "redo" down the left
+column looks strange, but it's there for a reason, and the reason is this:
+you can cut-and-paste a line from the build script and rerun it directly.
+
+	$ redo t/c
+	redo  t/c
+	redo    t/c.c
+	redo      t/c.c.c
+	redo        t/c.c.c.b
+	redo          t/c.c.c.b.b
+
+So if you ever want to debug what happened at a particular step, you can
+choose to run only that step in verbose mode:
+
+	$ redo t/c.c.c.b.b -x
+	redo  t/c.c.c.b.b
+	* sh -ex default.b.do c.c.c.b .b c.c.c.b.b.redo2.tmp
+	+ redo-ifchange c.c.c.b.b.a
+	+ echo a-to-b
+	+ cat c.c.c.b.b.a
+	+ ./sleep 1.1
+	redo  t/c.c.c.b.b (done)
+	
+
+If you're using an autobuilder or something that logs build results for
+future examination, you should probably set it to always run redo with
+the -x option.
+
+
+# How fast is redo compared to make?
+
+FIXME:
+The current version of redo is written in python and has not been optimized. 
+So right now, it's usually a bit slower.  Not too embarrassingly slower,
+though, and the slowness mostly only strikes when you're
+building a project from scratch.
+
+For incrementally building only the changed parts of the project, redo can
+be much faster than make, because it can check all the dependencies up
+front and doesn't need to repeatedly parse and re-parse the Makefile (as
+recursive make needs to do).
+
+redo's sqlite3-based dependency database is very fast (and
+it would be even faster if we rewrite redo in C instead of
+python).  Better still, it would be possible to write an
+inotify daemon that can update the dependency database in
+real time; if you're running the daemon, you can run 'redo'
+from the toplevel and if your build is clean, it could return
+instantly, no matter how many dependencies you have.
+
+On my machine, redo can currently check about 10,000
+dependencies per second.  As an example, a program that
+depends on every single .c or .h file in the Linux kernel
+2.6.36 repo (about 36000 files) can be checked in about 4
+seconds.
+
+Rewritten in C, dependency checking would probably go about
+10 times faster still.
+
+This probably isn't too hard; the design of redo is so simple that
+it should be easy to write in any language.  It's just
+*even easier* in python, which was good for writing the
+prototype and debugging the parallelism and locking rules.
+
+Most of the slowness at the moment is because redo-ifchange
+(and also sh itself) need to be fork'd and exec'd over and
+over during the build process.
+
+As a point of reference, on my computer, I can fork-exec
+redo-ifchange.py about 87 times per second; an empty python
+program, about 100 times per second; an empty C program,
+about 1000 times per second; an empty make, about 300 times
+per second.  So if I could compile 87 files per second with
+gcc, which I can't because gcc is slower than that, then
+python overhead would be 50%.  Since gcc is slower than
+that, python overhead is generally much less - more like
+10%.
+
+Also, if you're using redo -j on a multicore machine, all
+the python forking happens in parallel with everything
+else, so that's 87 per second per core.  Nevertheless,
+that's still slower than make and should be fixed.
+
+(On the other hand, all this measurement is confounded
+because redo's more fine-grained dependencies mean you can
+have more parallelism.  So if you have a lot of CPU cores, redo
+might build *faster* than make just because it makes better
+use of them.)
--- a/Documentation/FAQInterop.md
+++ b/Documentation/FAQInterop.md
@ -0,0 +1,55 @@
+# Is redo compatible with autoconf?
+
+Yes.  You don't have to do anything special, other than the above note about
+declaring dependencies on config.h, which is no worse than what you would
+have to do with make.
+
+
+# Is redo compatible with automake?
+
+Hells no.  You can thank me later.  But see next question.
+
+
+# Is redo compatible with make?
+
+Yes.  If you have an existing Makefile (for example, in one of your
+subprojects), you can just call make from a .do script to build that
+subproject.
+
+In a file called myproject.stamp.do:
+
+	redo-ifchange $(find myproject -name '*.[ch]')
+	make -C myproject all
+
+So, to amend our answer to the previous question, you *can* use
+automake-generated Makefiles as part of your redo-based project.
+
+
+# Is redo -j compatible with make -j?
+
+Yes!  redo implements the same jobserver protocol as GNU make, which means
+that redo running under make -j, or make running under redo -j, will do the
+right thing.  Thus, it's safe to mix-and-match redo and make in a recursive
+build system.
+
+Just make sure you declare your dependencies correctly;
+redo won't know all the specific dependencies included in
+your Makefile, and make won't know your redo dependencies,
+of course.
+
+One way of cheating is to just have your make.do script
+depend on *all* the source files of a subproject, like
+this:
+
+	make -C subproject all
+	find subproject -name '*.[ch]' | xargs redo-ifchange
+
+Now if any of the .c or .h files in subproject are changed,
+your make.do will run, which calls into the subproject to
+rebuild anything that might be needed.  Worst case, if the
+dependencies are too generous, we end up calling 'make all'
+more often than necessary.  But 'make all' probably runs
+pretty fast when there's nothing to do, so that's not so
+bad.
+
+
--- a/Documentation/FAQParallel.md
+++ b/Documentation/FAQParallel.md
@ -0,0 +1,158 @@
+# Parallelism if more than one target depends on the same subdir
+
+Recursive make is especially painful when it comes to
+parallelism.  Take a look at this Makefile fragment:
+
+	all: fred bob
+	subproj:
+		touch $@.new
+		sleep 1
+		mv $@.new $@
+	fred:
+		$(MAKE) subproj
+		touch $@
+	bob:
+		$(MAKE) subproj
+		touch $@
+
+If we run it serially, it all looks good:
+
+	$ rm -f subproj fred bob; make --no-print-directory
+	make subproj
+	touch subproj.new
+	sleep 1
+	mv subproj.new subproj
+	touch fred
+	make subproj
+	make[1]: 'subproj' is up to date.
+	touch bob
+	
+But if we run it in parallel, life sucks:
+
+	$ rm -f subproj fred bob; make -j2 --no-print-directory
+	make subproj
+	make subproj
+	touch subproj.new
+	touch subproj.new
+	sleep 1
+	sleep 1
+	mv subproj.new subproj
+	mv subproj.new subproj
+	mv: cannot stat 'ubproj.new': No such file or directory
+	touch fred
+	make[1]: *** [subproj] Error 1
+	make: *** [bob] Error 2
+	
+What happened?  The sub-make that runs `subproj` ended up
+getting twice at once, because both fred and bob need to
+build it.
+
+If fred and bob had put in a *dependency* on subproj, then
+GNU make would be smart enough to only build one of them at
+a time; it can do ordering inside a single make process. 
+So this example is a bit contrived.  But imagine that fred
+and bob are two separate applications being built from the
+same toplevel Makefile, and they both depend on the library
+in subproj.  You'd run into this problem if you use
+recursive make.
+
+Of course, you might try to solve this by using
+*nonrecursive* make, but that's really hard.  What if
+subproj is a library from some other vendor?  Will you
+modify all their makefiles to fit into your nonrecursive
+makefile scheme?  Probably not.
+
+Another common workaround is to have the toplevel Makefile
+build subproj, then fred and bob.  This works, but if you
+don't run the toplevel Makefile and want to go straight
+to work in the fred project, building fred won't actually
+build subproj first, and you'll get errors.
+
+redo solves all these problems.  It maintains global locks
+across all its instances, so you're guaranteed that no two
+instances will try to build subproj at the same time.  And
+this works even if subproj is a make-based project; you
+just need a simple subproj.do that runs `make subproj`.
+
+
+# Dependency problems that only show up during parallel builds
+
+One annoying thing about parallel builds is... they do more
+things in parallel.  A very common problem in make is to
+have a Makefile rule that looks like this:
+
+	all: a b c
+	
+When you `make all`, it first builds a, then b, then c. 
+What if c depends on b?  Well, it doesn't matter when
+you're building in serial.  But with -j3, you end up
+building a, b, and c at the same time, and the build for c
+crashes.  You *should* have said:
+
+	all: a b c
+	c: b
+	b: a
+	
+and that would have fixed it.  But you forgot, and you
+don't find out until you build with exactly the wrong -j
+option.
+
+This mistake is easy to make in redo too.  But it does have
+a tool that helps you debug it: the --shuffle option. 
+--shuffle takes the dependencies of each target, and builds
+them in a random order.  So you can get parallel-like
+results without actually building in parallel.
+
+
+# What about distributed builds?
+
+FIXME:
+So far, nobody has tried redo in a distributed build environment.  It surely
+works with distcc, since that's just a distributed compiler.  But there are
+other systems that distribute more of the build process to other machines.
+
+The most interesting method I've heard of was explained (in public, this is
+not proprietary information) by someone from Google.  Apparently, the
+Android team uses a tool that mounts your entire local filesystem on a
+remote machine using FUSE and chroots into that directory.  Then you replace
+the $SHELL variable in your copy of make with one that runs this tool. 
+Because the remote filesystem is identical to yours, the build will
+certainly complete successfully.  After the $SHELL program exits, the changed
+files are sent back to your local machine.  Cleverly, the files on the
+remote server are cached based on their checksums, so files only need to be
+re-sent if they have changed since last time.  This dramatically reduces
+bandwidth usage compared to, say, distcc (which mostly just re-sends the
+same preparsed headers over and over again).
+
+At the time, he promised to open source this tool eventually.  It would be
+pretty fun to play with it.
+
+The problem:
+
+This idea won't work as easily with redo as it did with
+make.  With make, a separate copy of $SHELL is launched for
+each step of the build (and gets migrated to the remote
+machine), but make runs only on your local machine, so it
+can control parallelism and avoid building the same target
+from multiple machines, and so on.  The key to the above
+distribution mechanism is it can send files to the remote
+machine at the beginning of the $SHELL, and send them back
+when the $SHELL exits, and know that nobody cares about
+them in the meantime.  With redo, since the entire script
+runs inside a shell (and the shell might not exit until the
+very end of the build), we'd have to do the parallelism
+some other way.
+
+I'm sure it's doable, however.  One nice thing about redo
+is that the source code is so small compared to make: you
+can just rewrite it.
+
+
+# Can I convince a sub-redo or sub-make to *not* use parallel builds?
+
+Yes.  Put this in your .do script:
+
+	unset MAKEFLAGS
+	
+The child makes will then not have access to the jobserver,
+so will build serially instead.
--- a/Documentation/FAQSemantics.md
+++ b/Documentation/FAQSemantics.md
@ -0,0 +1,582 @@
+# Can I put all my rules in one big Redofile like make does?
+
+One of my favourite features of redo is that it doesn't add any new syntax;
+the syntax of redo is *exactly* the syntax of sh... because sh is the program
+interpreting your .do file.
+
+Also, it's surprisingly useful to have each build script in its own file;
+that way, you can declare a dependency on just that one build script instead
+of the entire Makefile, and you won't have to rebuild everything just
+because of a one-line Makefile change.  (Some build tools avoid that same
+problem by tracking which variables and commands were used to do the build. 
+But that's more complex, more error prone, and slower.)
+
+See djb's [Target files depend on build scripts](http://cr.yp.to/redo/honest-script.html)
+article for more information.
+
+However, if you really want to, you can simply create a
+default.do that looks something like this:
+
+	case $1 in
+		*.o) ...compile a .o file... ;;
+		myprog)  ...link a program... ;;
+		*) echo "no rule to build '$1'" >&2; exit 1 ;;
+	esac
+	
+Basically, default.do is the equivalent of a central
+Makefile in make.  As of recent versions of redo, you can
+use either a single toplevel default.do (which catches
+requests for files anywhere in the project that don't have
+their own .do files) or one per directory, or any
+combination of the above.  And you can put some of your
+targets in default.do and some of them in their own files. 
+Lay it out in whatever way makes sense to you.
+
+One more thing: if you put all your build rules in a single
+default.do, you'll soon discover that changing *anything*
+in that default.do will cause all your targets to rebuilt -
+because their .do file has changed.  This is technically
+correct, but you might find it annoying.  To work around
+it, try making your default.do look like this:
+
+	. ./default.od
+
+And then put the above case statement in default.od
+instead.  Since you didn't `redo-ifchange default.od`,
+changes to default.od won't cause everything to rebuild.
+
+
+# What are the parameters ($1, $2, $3) to a .do file?
+
+NOTE: These definitions have changed since the earliest
+(pre-0.10) versions of redo.  The new definitions match
+what djb's original redo implementation did.
+
+$1 is the name of the target file.
+
+$2 is the basename of the target, minus the extension, if
+any.
+
+$3 is the name of a temporary file that will be renamed to
+the target filename atomically if your .do file returns a
+zero (success) exit code.
+
+In a file called `chicken.a.b.c.do` that builds a file called
+`chicken.a.b.c`, $1 and $2 are `chicken.a.b.c`, and $3 is a
+temporary name like `chicken.a.b.c.tmp`.  You might have expected
+$2 to be just `chicken`, but that's not possible, because
+redo doesn't know which portion of the filename is the
+"extension."  Is it `.c`, `.b.c`, or `.a.b.c`?
+
+.do files starting with `default.` are special; they can
+build any target ending with the given extension.  So let's
+say we have a file named `default.c.do` building a file
+called `chicken.a.b.c`. $1 is `chicken.a.b.c`, $2 is `chicken.a.b`,
+and $3 is a temporary name like `chicken.a.b.c.tmp`.
+
+You should use $1 and $2 only in constructing input
+filenames and dependencies; never modify the file named by
+$1 in your script.  Only ever write to the file named by
+$3.  That way redo can guarantee proper dependency
+management and atomicity.  (For convenience, you can write
+to stdout instead of $3 if you want.)
+
+For example, you could compile a .c file into a .o file
+like this, from a script named `default.o.do`:
+	
+	redo-ifchange $2.c
+	gcc -o $3 -c $2.c
+
+
+# Why not $FILE, $BASE, $OUT instead of $1, $2, $3?
+
+That sounds tempting and easy, but one downside would be
+lack of backward compatibility with djb's original redo
+design.
+
+Longer names aren't necessarily better.  Learning the
+meanings of the three numbers doesn't take long, and over
+time, those extra few keystrokes can add up.  And remember
+that Makefiles and perl have had strange one-character
+variable names for a long time.  It's not at all clear that
+removing them is an improvement.
+
+
+# What happens to stdin/stdout/stderr?
+
+As with make, stdin is not redirected.  You're probably
+better off not using it, though, because especially with
+parallel builds, it might not do anything useful.  We might
+change this behaviour someday since it's such a terrible
+idea for .do scripts to read from stdin.
+
+As with make, stderr is also not redirected.  You can use
+it to print status messages as your build proceeds. 
+(Eventually, we might want to capture stderr so it's easier
+to look at the results of parallel builds, but this is
+tricky to do in a user-friendly way.)
+
+Redo treats stdout specially: it redirects it to point at
+$3 (see previous question).  That is, if your .do file
+writes to stdout, then the data it writes ends up in the
+output file.  Thus, a really simple `chicken.do` file that
+contains only this:
+
+    echo hello world
+
+will correctly, and atomically, generate an output file
+named `chicken` only if the echo command succeeds.
+
+
+# Isn't it confusing to capture stdout by default?
+
+Yes, it is.  It's unlike what almost any other program
+does, especially make, and it's very easy to make a
+mistake.  For example, if you write in your script:
+
+	echo "Hello world"
+	
+it will go to the target file rather than to the screen.
+
+A more common mistake is to run a program that writes to
+stdout by accident as it runs.  When you do that, you'll
+produce your target on $3, but it might be intermingled
+with junk you wrote to stdout.  redo is pretty good about
+catching this mistake, and it'll print a message like this:
+
+	redo  zot.do wrote to stdout *and* created $3.
+	redo  ...you should write status messages to stderr, not stdout.
+	redo  zot: exit code 207
+
+Despite the disadvantages, though, automatically capturing
+stdout does make certain kinds of .do scripts really
+elegant.  The "simplest possible .do file" can be very
+short.  For example, here's one that produces a sub-list
+from a list:
+
+	redo-ifchange filelist
+	grep ^src/ filelist
+	
+redo's simplicity is an attempt to capture the "Zen of
+Unix," which has a lot to do with concepts like pipelines
+and stdout.  Why should every program have to implement its
+own -o (output filename) option when the shell already has
+a redirection operator?  Maybe if redo gets more popular,
+more programs in the world will be able to be even simpler
+than they are today.
+
+By the way, if you're running some programs that might
+misbehave and write garbage to stdout instead of stderr
+(Informational/status messages always belong on stderr, not
+stdout!  Fix your programs!), then just add this line to
+the top of your .do script:
+
+	exec >&2
+	
+That will redirect your stdout to stderr, so it works more
+like you expect.
+
+
+# Run redo-ifchange in a loop?
+
+The obvious way to write a list of dependencies might be
+something like this:
+
+	for d in *.c; do
+		redo-ifchange ${d%.c}.o
+	done
+
+But it turns out that's very non-optimal.  First of all, it
+forces all your dependencies to be built in order
+(redo-ifchange doesn't return until it has finished
+building), which makes -j parallelism a lot less useful. 
+And secondly, it forks and execs redo-ifchange over and
+over, which can waste CPU time unnecessarily.
+
+A better way is something like this:
+
+	for d in *.c; do
+		echo ${d%.c}.o
+	done |
+	xargs redo-ifchange
+	
+That only runs redo-ifchange once (or maybe a few times, if
+there are really a *lot* of dependencies and xargs has to
+split it up), which saves fork/exec time and allows for
+parallelism.
+
+
+# If a target is identical after rebuilding, how do I prevent dependents from being rebuilt?
+
+For example, running ./configure creates a bunch of files including
+config.h, and config.h might or might not change from one run to the next. 
+We don't want to rebuild everything that depends on config.h if config.h is
+identical.
+
+With `make`, which makes build decisions based on timestamps, you would
+simply have the ./configure script write to config.h.new, then only
+overwrite config.h with that if the two files are different. 
+However, that's a bit tedious.
+
+With `redo`, there's an easier way.  You can have a
+config.do script that looks like this:
+
+	redo-ifchange autogen.sh *.ac
+	./autogen.sh
+	./configure
+	cat config.h configure Makefile | redo-stamp
+	
+Now any of your other .do files can depend on a target called
+`config`.  `config` gets rebuilt automatically if any of
+your autoconf input files are changed (or if someone does
+`redo config` to force it).  But because of the call to
+redo-stamp, `config` is only considered to have changed if
+the contents of config.h, configure, or Makefile are
+different than they were before.
+
+(Note that you might actually want to break this .do up into a
+few phases: for example, one that runs aclocal, one that
+runs autoconf, and one that runs ./configure.  That way
+your build can always do the minimum amount of work
+necessary.)
+
+
+# Why does 'redo target' redo even unchanged targets?
+
+When you run `make target`, make first checks the
+dependencies of target; if they've changed, then it
+rebuilds target.  Otherwise it does nothing.
+
+redo is a little different.  It splits the build into two
+steps.  `redo target` is the second step; if you run that
+at the command line, it just runs the .do file, whether it
+needs it or not.
+
+If you really want to only rebuild targets that have
+changed, you can run `redo-ifchange target` instead.
+
+The reasons I like this arrangement come down to semantics:
+
+- "make target" implies that if target exists, you're done;
+  conversely, "redo target" in English implies you really
+  want to *redo* it, not just sit around.
+  
+- If this weren't the rule, `redo` and `redo-ifchange`
+  would mean the same thing, which seems rather confusing.
+  
+- If `redo` could refuse to run a .do script, you would
+  have no easy one-line way to force a particular target to
+  be rebuilt.  You'd have to remove the target and *then*
+  redo it, which is more typing.  On the other hand, nobody
+  actually types "redo foo.o" if they honestly think foo.o
+  doesn't need rebuilding.
+  
+- For "contentless" targets like "test" or "clean", it would
+  be extremely confusing if they refused to run just
+  because they ran successfully last time.
+  
+In make, things get complicated because it doesn't
+differentiate between these two modes.  Makefile rules
+with no dependencies run every time, *unless* the target
+exists, in which case they run never, *unless* the target
+is marked ".PHONY", in which case they run every time.  But
+targets that *do* have dependencies follow totally
+different rules.  And all this is needed because there's no
+way to tell make, "Listen, I just really want you to run
+the rules for this target *right now*."
+
+With redo, the semantics are really simple to explain.  If
+your brain has already been fried by make, you might be
+surprised by it at first, but once you get used to it, it's
+really much nicer this way.
+
+
+# Can I write .do files in my favourite language, not sh?
+
+Yes.  If the first line of your .do file starts with the
+magic "#!/" sequence (eg. `#!/usr/bin/python`), then redo
+will execute your script using that particular interpreter.
+
+Note that this is slightly different from normal Unix
+execution semantics. redo never execs your script directly;
+it only looks for the "#!/" line.  The main reason for this
+is so that your .do scripts don't have to be marked
+executable (chmod +x).  Executable .do scripts would
+suggest to users that they should run them directly, and
+they shouldn't; .do scripts should always be executed
+inside an instance of redo, so that dependencies can be
+tracked correctly.
+
+WARNING: If your .do script *is* written in Unix sh, we
+recommend *not* including the `#!/bin/sh` line.  That's
+because there are many variations of /bin/sh, and not all
+of them are POSIX compliant.  redo tries pretty hard to
+find a good default shell that will be "as POSIXy as
+possible," and if you override it using #!/bin/sh, you lose
+this benefit and you'll have to worry more about
+portability.
+
+
+# Can a single .do script generate multiple outputs?
+
+FIXME: Yes, but this is a bit imperfect.
+
+For example, compiling a .java file produces a bunch of .class
+files, but exactly which files?  It depends on the content
+of the .java file.  Ideally, we would like to allow our .do
+file to compile the .java file, note which .class files
+were generated, and tell redo about it for dependency
+checking.
+
+However, this ends up being confusing; if myprog depends
+on foo.class, we know that foo.class was generated from
+bar.java only *after* bar.java has been compiled.  But how
+do you know, the first time someone asks to build myprog,
+where foo.class is supposed to come from?
+
+So we haven't thought about this enough yet.
+
+Note that it's *okay* for a .do file to produce targets
+other than the advertised one; you just have to be careful. 
+You could have a default.javac.do that runs 'javac
+$2.java', and then have your program depend on a bunch of .javac
+files.  Just be careful not to depend on the .class files
+themselves, since redo won't know how to regenerate them.
+
+This feature would also be useful, again, with ./configure:
+typically running the configure script produces several
+output files, and it would be nice to declare dependencies
+on all of them.
+
+
+# Should I use environment variables to affect my build?
+
+Directly using environment variables is a bad idea because you can't declare
+dependencies on them.  Also, if there were a file that contained a set of
+variables that all your .do scripts need to run, then `redo` would have to
+read that file every time it starts (which is frequently, since it's
+recursive), and that could get slow.
+
+Luckily, there's an alternative.  Once you get used to it, this method is
+actually much better than environment variables, because it runs faster
+*and* it's easier to debug.
+
+For example, djb often uses a computer-generated script called `compile` for
+compiling a .c file into a .o file.  To generate the `compile` script, we
+create a file called `compile.do`:
+	
+	redo-ifchange config.sh
+	. ./config.sh
+	echo "gcc -c -o \$3 \$2.c $CFLAGS" >$3
+	chmod a+x $3
+
+Then, your `default.o.do` can simply look like this:
+
+	redo-ifchange compile $2.c
+	./compile $1 $2 $3
+
+This is not only elegant, it's useful too.  With make, you have to always
+output everything it does to stdout/stderr so you can try to figure out
+exactly what it was running; because this gets noisy, some people write
+Makefiles that deliberately hide the output and print something friendlier,
+like "Compiling hello.c".  But then you have to guess what the compile
+command looked like.
+
+With redo, the command *is* `./compile hello.c`, which looks good when
+printed, but is also completely meaningful.  Because it doesn't depend on
+any environment variables, you can just run `./compile hello.c` to reproduce
+its output, or you can look inside the `compile` file to see exactly what
+command line is being used.
+
+As a bonus, all the variable expansions only need to be done once: when
+generating the ./compile program.  With make, it would be recalculating
+expansions every time it compiles a file.  Because of the
+way make does expansions as macros instead of as normal
+variables, this can be slow.
+
+
+# Example default.o.do for both C and C++ source?
+
+We can upgrade the compile.do from the previous answer to
+look something like this:
+
+        redo-ifchange config.sh
+        . ./config.sh
+        cat <<-EOF
+                [ -e "\$2.cc" ] && EXT=.cc || EXT=.c
+                gcc -o "\$3" -c "\$1\$EXT" -Wall $CFLAGS
+        EOF
+        chmod a+x "$3"
+
+Isn't it expensive to have ./compile doing this kind of test for every
+single source file?  Not really.  Remember, if you have two implicit rules
+in make:
+
+	%.o: %.cc
+		gcc ...
+
+	%.o: %.c
+		gcc ...
+		
+Then it has to do all the same checks.  Except make has even *more* implicit
+rules than that, so it ends up trying and discarding lots of possibilities
+before it actually builds your program.  Is there a %.s?  A
+%.cpp?  A %.pas?  It needs to look for *all* of them, and
+it gets slow.  The more implicit rules you have, the slower
+make gets.
+
+In redo, it's not implicit at all; you're specifying exactly how to
+decide whether it's a C program or a C++ program, and what to do in each
+case.  Plus you can share the two gcc command lines between the two rules,
+which is hard in make.  (In GNU make you can use macro functions, but the
+syntax for those is ugly.)
+
+
+# Can I just rebuild just part of a project?
+
+Absolutely!  Although `redo` runs "top down" in the sense of one .do file
+calling into all its dependencies, you can start at any point in the
+dependency tree that you want.
+
+Unlike recursive make, no matter which subdir of your project you're in when
+you start, `redo` will be able to build all the dependencies in the right
+order.
+
+Unlike non-recursive make, you don't have to jump through any strange hoops
+(like adding, in each directory, a fake Makefile that does `make -C ${TOPDIR}`
+back up to the main non-recursive Makefile).  redo just uses `filename.do`
+to build `filename`, or uses `default*.do` if the specific `filename.do`
+doesn't exist.
+
+When running any .do file, `redo` makes sure its current directory is set to
+the directory where the .do file is located.  That means you can do this:
+
+	redo ../utils/foo.o
+	
+And it will work exactly like this:
+
+	cd ../utils
+	redo foo.o
+	
+In make, if you run
+
+	make ../utils/foo.o
+	
+it means to look in ./Makefile for a rule called
+../utils/foo.o... and it probably doesn't have such a
+rule.  On the other hand, if you run
+
+	cd ../utils
+	make foo.o
+	
+it means to look in ../utils/Makefile and look for a rule
+called foo.o.  And that might do something totally
+different!  redo combines these two forms and does
+the right thing in both cases.
+
+Note: redo will always change to the directory containing
+the .do file before trying to build it.  So if you do
+
+	redo ../utils/foo.o
+
+the ../utils/default.o.do file will be run with its current directory set to
+../utils.  Thus, the .do file's runtime environment is
+always reliable.
+
+On the other hand, if you had a file called ../default.o.do,
+but there was no ../utils/default.o.do, redo would select
+../default.o.do as the best matching .do file.  It would
+then run with its current directory set to .., and tell
+default.o.do to create an output file called "utils/foo.o"
+(that is, foo.o, with a relative path explaining how to
+find foo.o when you're starting from the directory
+containing the .do file).
+
+That sounds a lot more complicated than it is.  The results
+are actually very simple: if you have a toplevel
+default.o.do, then all your .o files will be compiled with
+$PWD set to the top level, and all the .o filenames passed
+as relative paths from $PWD.  That way, if you use relative
+paths in -I and -L gcc options (for example), they will
+always be correct no matter where in the hierarchy your
+source files are.
+
+
+# Can I put my .o files in a different directory from my .c files?
+
+Yes.  There's nothing in redo that assumes anything about
+the location of your source files.  You can do all sorts of
+interesting tricks, limited only by your imagination.  For
+example, imagine that you have a toplevel default.o.do that looks
+like this:
+
+	ARCH=${1#out/}
+	ARCH=${ARCH%%/*}
+	SRC=${1#out/$ARCH/}
+	redo-ifchange $SRC.c
+	$ARCH-gcc -o $3 -c $SRC.c
+
+If you run `redo out/i586-mingw32msvc/path/to/foo.o`, then
+the above script would end up running
+
+	i586-mingw32msvc-gcc -o $3 -c path/to/foo.c
+
+You could also choose to read the compiler name or options from
+out/$ARCH/config.sh, or config.$ARCH.sh, or use any other
+arrangement you want.
+
+You could use the same technique to have separate build
+directories for out/debug, out/optimized, out/profiled, and so on.
+
+
+# Can my filenames have spaces in them?
+
+Yes, unlike with make.  For historical reasons, the Makefile syntax doesn't
+support filenames with spaces; spaces are used to separate one filename from
+the next, and there's no way to escape these spaces.
+
+Since redo just uses sh, which has working escape characters and
+quoting, it doesn't have this problem.
+
+
+# Does redo care about the differences between tabs and spaces?
+
+No.
+
+
+# What if my .c file depends on a generated .h file?
+
+This problem arises as follows.  foo.c includes config.h, and config.h is
+created by running ./configure.  The second part is easy; just write a
+config.h.do that depends on the existence of configure (which is created by
+configure.do, which probably runs autoconf).
+
+The first part, however, is not so easy.  Normally, the headers that a C
+file depends on are detected as part of the compilation process.  That works
+fine if the headers, themselves, don't need to be generated first.  But if
+you do
+
+	redo foo.o
+	
+There's no way for redo to *automatically* know that compiling foo.c
+into foo.o depends on first generating config.h.
+
+Since most .h files are *not* auto-generated, the easiest
+thing to do is probably to just add a line like this to
+your default.o.do:
+
+	redo-ifchange config.h
+	
+Sometimes a specific solution is much easier than a general
+one.
+
+If you really want to solve the general case,
+[djb has a solution for his own
+projects](http://cr.yp.to/redo/honest-nonfile.html), which is a simple
+script that looks through C files to pull out #include lines.  He assumes
+that `#include <file.h>` is a system header (thus not subject to being
+built) and `#include "file.h"` is in the current directory (thus easy to
+find).  Unfortunately this isn't really a complete
+solution, but at least it would be able to redo-ifchange a
+required header before compiling a program that requires
+that header.
--- a/Documentation/GettingStarted.md
+++ b/Documentation/GettingStarted.md
@ -0,0 +1,36 @@
+# Prerequisites
+
+Currently, this version of redo requires python2.7 and the python2.7 sqlite3 module. 
+Optional, but recommended, is the
+[setproctitle](http://code.google.com/p/py-setproctitle/) module, which makes your
+`ps` output prettier.
+
+In modern versions of Debian, sqlite3 is already part of the python2.7 package. 
+You can install the requirements like this:
+```sh
+	sudo apt-get install python2.7 python-setproctitle
+```
+(If you have install instructions for other OSes, please add them here :))
+
+
+# Clone, compile, and test redo
+
+You can run redo without installing it, like this:
+```sh
+	git clone https://github.com/apenwarr/redo
+	cd redo
+	./redo -j10 test
+```
+
+If the tests pass, you can either add $PWD/redo to your PATH, or install
+redo on your system.  To install for all users, put it in /usr/local:
+
+```sh
+	PREFIX=/usr/local sudo ./redo install
+```
+
+Or to install it just for yourself (without needing root access), put it in
+your home directory:
+```sh
+	PREFIX=$HOME ./redo install
+```
--- a/Documentation/index.md
+++ b/Documentation/index.md
@ -1 +1,244 @@
-Hello world!
+# redo: a recursive, general-purpose build system
+
+`redo` is a competitor to the long-lived, but sadly imperfect, `make`
+program.  There are many such competitors, because many people over the
+years have been dissatisfied with make's limitations.  However, of all the
+replacements I've seen, only redo captures the essential simplicity and
+flexibility of make, while avoiding its flaws.  To my great surprise, it
+manages to do this while being simultaneously simpler than make, more
+flexible than make, and more powerful than make.
+
+Although I wrote redo and I would love to take credit for it, the magical
+simplicity and flexibility comes because I copied verbatim a design by
+Daniel J. Bernstein (creator of qmail and djbdns, among many other useful
+things).  He posted some very terse notes on his web site at one point
+(there is no date) with the unassuming title, "[Rebuilding target files when
+source files have changed](http://cr.yp.to/redo.html)." Those notes are
+enough information to understand how the system is supposed to work;
+unfortunately there's no code to go with it.  I get the impression that the
+hypothetical "djb redo" is incomplete and Bernstein doesn't yet consider it
+ready for the real world.
+
+I was led to that particular page by random chance from a link on [The djb
+way](http://thedjbway.b0llix.net/future.html), by Wayne Marshall.
+
+After I found out about djb redo, I searched the Internet for any sign that
+other people had discovered what I had: a hidden, unimplemented gem of
+brilliant code design.  I found only one interesting link: Alan Grosskurth,
+whose [Master's thesis at the University of Waterloo](http://grosskurth.ca/papers/mmath-thesis.pdf)
+was about top-down software rebuilding, that is, djb redo.  He wrote his
+own (admittedly slow) implementation in about 250 lines of shell script.
+
+If you've ever thought about rewriting GNU make from scratch, the idea of
+doing it in 250 lines of shell script probably didn't occur to you.  redo is
+so simple that it's actually possible.  For testing, I actually wrote an
+even more minimal version, which always rebuilds everything instead of
+checking dependencies, in 210 lines of shell (about 4 kbytes).
+
+The design is simply that good.
+
+My implementation of redo is called `redo` for the same reason that there
+are 75 different versions of `make` that are all called `make`.  It's somehow
+easier that way.  Hopefully it will turn out to be compatible with the other
+implementations, should there be any.
+
+My extremely minimal implementation, called `do`, is in the `minimal/`
+directory of this repository.
+
+(Want to discuss redo?  See the bottom of this file for
+information about our mailing list.)
+
+
+# What's so special about redo?
+
+The theory behind redo is almost magical: it can do everything `make` can
+do, only the implementation is vastly simpler, the syntax is cleaner, and you
+can do even more flexible things without resorting to ugly hacks.  Also, you
+get all the speed of non-recursive `make` (only check dependencies once per
+run) combined with all the cleanliness of recursive `make` (you don't have
+code from one module stomping on code from another module).
+
+(Disclaimer: my current implementation is not as fast as `make` for some
+things, because it's written in python.  Eventually I'll rewrite it an C and
+it'll be very, very fast.)
+
+The easiest way to show it is with an example.
+
+Create a file called default.o.do:
+
+	redo-ifchange $2.c
+	gcc -MD -MF $2.d -c -o $3 $2.c
+	read DEPS <$2.d
+	redo-ifchange ${DEPS#*:}
+
+Create a file called myprog.do:
+
+	DEPS="a.o b.o"
+	redo-ifchange $DEPS
+	gcc -o $3 $DEPS
+	
+Of course, you'll also have to create `a.c` and `b.c`, the C language
+source files that you want to build to create your application.
+
+In a.c:
+
+	#include <stdio.h>
+	#include "b.h"
+	
+	int main() { printf(bstr); }
+	
+In b.h:
+
+	extern char *bstr;
+	
+In b.c:
+
+	char *bstr = "hello, world!\n";
+
+Now you simply run:
+
+	$ redo myprog
+	
+And it says:
+
+	redo  myprog
+	redo    a.o
+	redo    b.o
+
+Now try this:
+
+	$ touch b.h
+	$ redo myprog
+	
+Sure enough, it says:
+
+	redo  myprog
+	redo    a.o
+
+Did you catch the shell incantation in `default.o.do` where it generates
+the autodependencies?  The filename `default.o.do` means "run this script to
+generate a .o file unless there's a more specific whatever.o.do script that
+applies."
+
+The key thing to understand about redo is that declaring a dependency is just
+another shell command.  The `redo-ifchange` command means, "build each of my
+arguments.  If any of them or their dependencies ever change, then I need to
+run the *current script* over again."
+
+Dependencies are tracked in a persistent `.redo` database so that redo can
+check them later.  If a file needs to be rebuilt, it re-executes the
+`whatever.do` script and regenerates the dependencies.  If a file doesn't
+need to be rebuilt, redo can calculate that just using its persistent
+`.redo` database, without re-running the script.  And it can do that check
+just once right at the start of your project build.
+
+But best of all, as you can see in `default.o.do`, you can declare a
+dependency *after* building the program.  In C, you get your best dependency
+information by trying to actually build, since that's how you find out which
+headers you need.  redo is based on the following simple insight:
+you don't actually
+care what the dependencies are *before* you build the target; if the target
+doesn't exist, you obviously need to build it.  Then, the build script
+itself can provide the dependency information however it wants; unlike in
+`make`, you don't need a special dependency syntax at all.  You can even
+declare some of your dependencies after building, which makes C-style
+autodependencies much simpler.
+
+(GNU make supports putting some of your dependencies in include files, and
+auto-reloading those include files if they change.  But this is very
+confusing - the program flow through a Makefile is hard to trace already,
+and even harder if it restarts randomly from the beginning when a file
+changes.  With redo, you can just read the script from top to bottom.  A
+`redo-ifchange` call is like calling a function, which you can also read
+from top to bottom.)
+
+
+# What projects use redo?
+
+Here are a few open source examples:
+
+* [Liberation Circuit](https://github.com/linleyh/liberation-circuit) is a
+  straightforward example of a C++ binary (a game) compiled with redo.
+
+* [WvStreams](https://github.com/apenwarr/wvstreams) uses a more complex
+  setup producing several binaries, libraries, and scripts.  It shows how to
+  produce output files in a different directory than the source files.
+
+* [WvBuild](https://github.com/apenwarr/wvbuild) can cross-compile several
+  dependencies, like openssl and zlib, and then builds WvStreams using those
+  same libraries.  It's a good example of redo/make interop and complex
+  dependencies.
+
+* There's an experimental [variant of
+  Buildroot](https://github.com/apenwarr/buildroot/tree/redo) that uses redo
+  in order to clean up its dependency logic.
+
+* You can also find some limited examples in the
+  [`t/111-example/`](t/111-example) subdir of the redo project itself.
+
+If you switch your program's build process to use redo, please let us know and
+we can link to it here.
+
+(Please don't use the code in the `t/` directory as serious examples of how
+to use redo.  Many of the tests are doing things in deliberately psychotic
+ways in order to stress redo's code and find bugs.)
+
+
+# How does this redo compare to other redo implementations?
+
+djb never released his version, so other people have implemented their own
+variants based on his [published specification](http://cr.yp.to/redo.html).
+
+This version, sometimes called apenwarr/redo, is probably the most advanced
+one, including support for parallel builds, advanced build logs, and helpful
+debugging features.  It's currently written in python for easier
+experimentation, but the plan is to eventually migrate it to plain C.  (Some
+people like to call this version "python-redo", but I don't like that name. 
+We shouldn't have to rename it just because we port the code to C.)
+
+Here are some other redo variants (thanks to Nils Dagsson Moskopp for many
+of these links):
+
+- Alan Grosskurth's [redo thesis](http://grosskurth.ca/papers/mmath-thesis.pdf)
+  and related sh implementation.  (Arguably, this paper is the one that got
+  all the rest of us started.)
+
+- Nils Dagsson Moskopp's [redo-sh](https://web.archive.org/web/20181106195145/http://news.dieweltistgarnichtso.net/bin/redo-sh.html)
+  is a completely self-sufficient sh-based implementation.
+
+- apenwarr's [minimal/do](https://github.com/apenwarr/redo/blob/master/minimal/do)
+  is included with this copy of redo.  It's also sh-based, but intended to
+  be simple and failsafe, so it doesn't understand how to "redo" targets more
+  than once.
+
+- Christian Neukirchen's [redo-c](https://github.com/chneukirchen/redo-c), a
+  C implementation.
+
+- Jonathan de Boyne Pollard's [fork of Alan Grosskurth's redo](http://jdebp.eu./Softwares/redo/grosskurth-redo.html)
+  (another sh-based implementation).
+
+- Jonathan de Boyne Pollard's [redo](http://jdebp.eu./Softwares/redo/)
+  rewritten in C++
+
+- Gyepi Sam's [redux](https://github.com/gyepisam/redux) in Go
+
+- jekor's [redo](https://github.com/jekor/redo) in Haskell
+
+- Shanti Bouchez-Mongardé (mildred)'s [fork of apenwarr's redo](https://github.com/mildred/redo)
+  in python
+
+- Tharre's [redo](https://github.com/Tharre/redo) in C
+
+- catenate's [credo](https://github.com/catenate/credo), a (very
+  rearchitected) variant written for the Inferno Shell.
+
+The redo design is so simple and elegant that many individuals have been
+inspired to (and able to) write their own version of it.  In the honoured
+tradition of Unix's `make`, they (almost) all just use the same name,
+`redo`.  Unfortunately, many of these
+implementations are unmaintained, slightly incompatible with the "standard"
+redo semantics, and/or have few or no automated tests.
+
+At the time of this writing, none of them except apenwarr/redo (ie.  this
+project) support parallel builds (`redo -j`).  For large projects,
+parallel builds are usually essential.