apenwarr-redo/redo/cmd_ifchange.py

"""redo-ifchange: build the given targets if they have changed."""
import os, sys, traceback
from . import env, builder, deps, jobserver, logs, state
from .logs import debug2, err


def should_build(t):
    f = state.File(name=t)
    if f.is_failed():
        raise builder.ImmediateReturn(32)
    dirty = deps.isdirty(f, depth='', max_changed=env.v.RUNID,
                         already_checked=[])
    return f.is_generated, dirty == [f] and deps.DIRTY or dirty


def main():
    rv = 202
    try:
        targets = sys.argv[1:]
        state.init(targets)
        if env.is_toplevel and not targets:
            targets = ['all']
        if env.is_toplevel and env.v.LOG:
            builder.close_stdin()
            builder.start_stdin_log_reader(
                status=True, details=True,
                pretty=True, color=True, debug_locks=False, debug_pids=False)
        else:
            logs.setup(
                tty=sys.stderr, parent_logs=env.v.LOG,
                pretty=env.v.PRETTY, color=env.v.COLOR)
        if env.v.TARGET and not env.v.UNLOCKED:
            me = os.path.join(env.v.STARTDIR,
                              os.path.join(env.v.PWD, env.v.TARGET))
            f = state.File(name=me)
            debug2('TARGET: %r %r %r\n'
                   % (env.v.STARTDIR, env.v.PWD, env.v.TARGET))
        else:
            f = me = None
            debug2('redo-ifchange: not adding depends.\n')
        jobserver.setup(0)
        try:
            if f:
                for t in targets:
                    f.add_dep('m', t)
                f.save()
                state.commit()
            rv = builder.run(targets, should_build)
        finally:
            try:
                state.rollback()
            finally:
                try:
                    jobserver.force_return_tokens()
                except Exception, e:  # pylint: disable=broad-except
                    traceback.print_exc(100, sys.stderr)
                    err('unexpected error: %r\n' % e)
                    rv = 1
    except KeyboardInterrupt:
        if env.is_toplevel:
            builder.await_log_reader()
        sys.exit(200)
    state.commit()
    if env.is_toplevel:
        builder.await_log_reader()
    sys.exit(rv)


if __name__ == '__main__':
    main()
Add a bunch of missing python docstrings. This appeases pylint, so un-disable its docstring warning. 2018-12-14 08:38:53 +00:00			`"""redo-ifchange: build the given targets if they have changed."""`
Further improve handling of symlink targets/deps. In commit redo-0.11-4-g34669fb, we changed os.stat into os.lstat to avoid false positives in the "manual override" detector: a .do file that generates $3 as a symlink would trigger manual override if the target of that symlink ever changed, which is incorrect. Unfortunately using os.lstat() leads to a different problem: if X depends on Y and Y is a symlink to Z, then X would not be rebuilt when Z changes, which is clearly wrong. The fix is twofold: 1. read_stamp() should change on changes to both the link itself, and the target of the link. 2. We shouldn't mark a target as overridden under so many situations. We'll use only the primary mtime of the os.lstat(), not all the other bits in the stamp. Step 2 fixes a few other false positives also. For example, if you 'cp -a' a whole tree to another location, the st_ino of all the targets will change, which would trigger a mass of "manual override" warnings. Although a change in inode is sufficient to count an input as having changed (just to be extra safe), it should not be considered a manual override. Now we can distinguish between the two. Because the stamp format has changed, update the SCHEMA_VER field. I should have done this every other time I changed the stamp format, but I forgot. Sorry. That leads to spurious "manually modified" warnings after upgrading redo. 2018-11-21 07:19:20 -05:00			`import os, sys, traceback`
Make calls to logs.setup() explicit in each cmd. Further reducing magic implicit behaviour to make code easier to follow. 2018-12-11 00:55:05 +00:00			`from . import env, builder, deps, jobserver, logs, state`
Switch to module-relative import syntax. Now that the python scripts are all in a "redo" python module, we can use the "new style" (ahem) package-relative imports. This appeases pylint, plus avoids confusion in case more than one package has similarly-named modules. 2018-12-05 02:34:36 -05:00			`from .logs import debug2, err`
Move 'redo --ifchange' into 'redo-ifchange' to match djb's style. It does simplify the logic of both redo.py and redo-ifchange.py, I suppose. 2010-11-13 00:45:49 -08:00
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00
Oops, earlier state.mark() stuff was a little too radical. If someone else built and marked one of our dependencies, then that dependency would show up as clean in a later redo-ifchange, so other dependents of that file wouldn't be rebuilt. We actually have to track two session-specific variables: whether the file has been checked, and whether it was rebuilt. (Or alternatively, whether it was dirty when we checked it the first time. But we store the former.) 2010-11-21 04:14:52 -08:00			`def should_build(t):`
Switch state.py to use sqlite3 instead of filesystem-based stamps. It passes all tests when run serialized, but still gives weird errors (OperationalError: database is locked) when run with -j5. sqlite3 shouldn't be barfing just because the database is locked, since the default timeout is 5 seconds, and it's dying way faster than that. 2010-12-07 02:17:22 -08:00			`f = state.File(name=t)`
Don't wipe the timestamp when a target fails to redo. It's really a separate condition. And since we're not removing the target file in case of error - we update it atomically, and keeping it is better than losing it - there's no reason to wipe the timestamp in that case either. However, we do need to know that the build failed, so that anybody else (especially in a parallel build) who looks at that target knows that it died. So add a separate flag just for that. 2010-12-10 20:53:31 -08:00			`if f.is_failed():`
			`raise builder.ImmediateReturn(32)`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`dirty = deps.isdirty(f, depth='', max_changed=env.v.RUNID,`
Print a better message when detecting pre-existing cyclic dependencies. We already printed an error at build time, but added the broken dependency anyway. If the .do script decided to succeed despite redo-ifchange aborting, the target would be successfully created and we'd end up with an infinite loop when running isdirty() later. The result was still "correct", because python helpfully aborted the infinite loop after the recursion got too deep. But let's explicitly detect it and print a better error message. (Thanks to Nils Dagsson Moskopp's redo-testcases repo for exposing this problem. If you put a #!/bin/sh header on your .do script means you need to run 'set -e' yourself if you want .do scripts to abort after an error, which you almost always do, and those testcases don't, which exposed this bug if you ran the tests twice.) 2018-10-30 23:03:46 -04:00			`already_checked=[])`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`return f.is_generated, dirty == [f] and deps.DIRTY or dirty`
Oops, earlier state.mark() stuff was a little too radical. If someone else built and marked one of our dependencies, then that dependency would show up as clean in a later redo-ifchange, so other dependents of that file wouldn't be rebuilt. We actually have to track two session-specific variables: whether the file has been checked, and whether it was rebuilt. (Or alternatively, whether it was dirty when we checked it the first time. But we store the former.) 2010-11-21 04:14:52 -08:00

Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`def main():`
			`rv = 202`
redo-ifchange.py: don't forget to release jwack tokens on exit. This only really matters if it exited abnormally... but it still matters. 2010-11-21 21:15:24 -08:00			`try:`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`targets = sys.argv[1:]`
			`state.init(targets)`
			`if env.is_toplevel and not targets:`
			`targets = ['all']`
			`if env.is_toplevel and env.v.LOG:`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`builder.close_stdin()`
			`builder.start_stdin_log_reader(`
			`status=True, details=True,`
			`pretty=True, color=True, debug_locks=False, debug_pids=False)`
Make calls to logs.setup() explicit in each cmd. Further reducing magic implicit behaviour to make code easier to follow. 2018-12-11 00:55:05 +00:00			`else:`
Workaround for completely broken file locking on Windows 10 WSL. WSL (Windows Services for Linux) provides a Linux-kernel-compatible ABI for userspace processes, but the current version doesn't not implement fcntl() locks at all; it just always returns success. See https://github.com/Microsoft/WSL/issues/1927. This causes us three kinds of problem: 1. sqlite3 in WAL mode gives "OperationalError: locking protocol". 1b. Other sqlite3 journal modes also don't work when used by multiple processes. 2. redo parallelism doesn't work, because we can't prevent the same target from being build several times simultaneously. 3. "redo-log -f" doesn't work, since it can't tell whether the log file it's tailing is "done" or not. To fix #1, we switch the sqlite3 journal back to PERSIST instead of WAL. We originally changed to WAL in commit 5156feae9d to reduce deadlocks on MacOS. That was never adequately explained, but PERSIST still acts weird on MacOS, so we'll only switch to PERSIST when we detect that locking is definitely broken. Sigh. To (mostly) fix #2, we disable any -j value > 1 when locking is broken. This prevents basic forms of parallelism, but doesn't stop you from re-entrantly starting other instances of redo. To fix that properly, we need to switch to a different locking mechanism entirely, which is tough in python. flock() locks probably work, for example, but python's locks lie and just use fcntl locks for those. To fix #3, we always force --no-log mode when we find that locking is broken. 2019-01-02 14:18:51 -05:00			`logs.setup(`
			`tty=sys.stderr, parent_logs=env.v.LOG,`
			`pretty=env.v.PRETTY, color=env.v.COLOR)`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`if env.v.TARGET and not env.v.UNLOCKED:`
			`me = os.path.join(env.v.STARTDIR,`
			`os.path.join(env.v.PWD, env.v.TARGET))`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`f = state.File(name=me)`
			`debug2('TARGET: %r %r %r\n'`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`% (env.v.STARTDIR, env.v.PWD, env.v.TARGET))`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`else:`
			`f = me = None`
			`debug2('redo-ifchange: not adding depends.\n')`
jobserver: allow overriding the parent jobserver in a subprocess. Previously, if you passed a -j option to a redo process in a redo or make process hierarchy with MAKEFLAGS already set, it would ignore the -j option and continue using the jobserver provided by the parent. With this change, we instead initialize a new jobserver with the desired number of tokens, which is what GNU make does in the same situation. A typical use case for this is to force serialization of build steps in a subtree (by using -j1). In make, this is often useful for "fixing" makefiles that haven't been written correctly for parallel builds. In redo, that happens much less often, but it's useful at least in unit tests. Passing -j1 is relatively harmless (the redo you are starting inherits a token anyway, so it doesn't create any new tokens). Passing -j > 1 is more risky, because it creates new tokens, thus increasing the level of parallelism in the system. Because this may not be what you wanted, we print a warning when you pass -j > 1 to a sub-redo. GNU make gives a similar warning in this situation. 2018-12-31 18:57:58 -05:00			`jobserver.setup(0)`
Add more assertions about uncommitted sqlite transactions. I think we were sometimes leaving half-done sqlite transactions sitting around for a long time (eg. across sub-calls to .do files). This seemed to be okay on Linux, but caused sqlite deadlocks on MacOS. Most likely it's not the operating system, but the sqlite version and journal mode in use. In any case, the correct thing to do is to actually commit or rollback transactions, not leave them hanging around. ...unfortunately this doesn't actually fix my MacOS deadlocks, which makes me rather nervous. 2018-10-06 04:36:24 -04:00			`try:`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`if f:`
			`for t in targets:`
			`f.add_dep('m', t)`
			`f.save()`
			`state.commit()`
Some renaming and comments to try to clarify builder and jobserver. The code is still a bit spaghetti-like, especialy when it comes to redo-unlocked, but at least the new names are slightly more comprehensible. 2018-12-11 02:57:29 +00:00			`rv = builder.run(targets, should_build)`
Add more assertions about uncommitted sqlite transactions. I think we were sometimes leaving half-done sqlite transactions sitting around for a long time (eg. across sub-calls to .do files). This seemed to be okay on Linux, but caused sqlite deadlocks on MacOS. Most likely it's not the operating system, but the sqlite version and journal mode in use. In any case, the correct thing to do is to actually commit or rollback transactions, not leave them hanging around. ...unfortunately this doesn't actually fix my MacOS deadlocks, which makes me rather nervous. 2018-10-06 04:36:24 -04:00			`finally:`
redo-log: prioritize the "foreground" process. When running a parallel build, redo-log -f (which is auto-started by redo) tries to traverse through the logs depth first, in the order parent processes started subprocesses. This works pretty well, but if its dependencies are locked, a process might have to give up its jobserver token while other stuff builds its dependencies. After the dependency finishes, the parent might not be able to get a token for quite some time, and the logs will appear to stop. To prevent this from happening, we can instantiate up to one "cheater" token, only in the foreground process (the one locked by redo-log -f), which will allow it to continue running, albeit a bit slowly (since it only has one token out of possibly many). When the process finishes, we then destroy the fake token. It gets a little complicated; see explanation at the top of jwack.py. 2018-11-17 04:32:09 -05:00			`try:`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`state.rollback()`
			`finally:`
			`try:`
Rename jwack.py -> jobserver.py. I'm not really sure why I called it jwack. I think it was kind of a wack jobserver(tm). But nowadays most of the wack-ness is gone. 2018-12-04 23:20:14 -05:00			`jobserver.force_return_tokens()`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`except Exception, e: # pylint: disable=broad-except`
			`traceback.print_exc(100, sys.stderr)`
			`err('unexpected error: %r\n' % e)`
			`rv = 1`
			`except KeyboardInterrupt:`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`if env.is_toplevel:`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`builder.await_log_reader()`
			`sys.exit(200)`
			`state.commit()`
env and env_init: Eliminate weird auto-initialization of globals. Merge the two files into env, and make each command explicitly call the function that sets it up in the way that's needed for that command. This means we can finally just import all the modules at the top of each file, without worrying about import order. Phew. While we're here, remove the weird auto-appending-'all'-to-targets feature in env.init(). Instead, do it explicitly, and only from redo and redo-ifchange, only if is_toplevel and no other targets are given. 2018-12-05 01:07:16 -05:00			`if env.is_toplevel:`
redo-log: capture and linearize the output of redo builds. redo now saves the stderr from every .do script, for every target, into a file in the .redo directory. That means you can look up the logs from the most recent build of any target using the new redo-log command, for example: redo-log -r all The default is to show logs non-recursively, that is, it'll show when a target does redo-ifchange on another target, but it won't recurse into the logs for the latter target. With -r (recursive), it does. With -u (unchanged), it does even if redo-ifchange discovered that the target was already up-to-date; in that case, it prints the logs of the most recent time the target was generated. With --no-details, redo-log will show only the 'redo' lines, not the other log messages. For very noisy build systems (like recursing into a 'make' instance) this can be helpful to get an overview of what happened, without all the cruft. You can use the -f (follow) option like tail -f, to follow a build that's currently in progress until it finishes. redo itself spins up a copy of redo-log -r -f while it runs, so you can see what's going on. Still broken in this version: - No man page or new tests yet. - ANSI colors don't yet work (unless you use --raw-logs, which gives the old-style behaviour). - You can't redirect the output of a sub-redo to a file or a pipe right now, because redo-log is eating it. - The regex for matching 'redo' lines in the log is very gross. Instead, we should put the raw log files in a more machine-parseable format, and redo-log should turn that into human-readable format. - redo-log tries to "linearize" the logs, which makes them comprehensible even for a large parallel build. It recursively shows log messages for each target in depth-first tree order (by tracing into a new target every time it sees a 'redo' line). This works really well, but in some specific cases, the "topmost" redo instance can get stuck waiting for a jwack token, which makes it look like the whole build has stalled, when really redo-log is just waiting a long time for a particular subprocess to be able to continue. We'll need to add a specific workaround for that. 2018-11-03 22:09:18 -04:00			`builder.await_log_reader()`
Move into the 21st century by fixing some pylint warnings. 2018-12-02 23:15:37 -05:00			`sys.exit(rv)`


			`if __name__ == '__main__':`
			`main()`