1 ### Structure of queue directories ###
3 <queue>/hosts/<hostname>/ Jobs queued for the given host
4 (they are executed in the lexicographic order of <job-id>s)
5 /<job-id>.job Symlink to <queue>/jobs/<job-id>.job
6 /<job-id>.stat (Optional) status of the job
7 /<job-id>.tmp Used temporarily by brun to store the script actually
8 sent to the host (can be inspected if something goes wrong)
9 /<job-id>.log (Optional) transcript of output produced by the job (including
10 previous failed attempts)
12 <queue>/jobs/ All jobs issued on this queue, including those which
13 are no longer queued for any machine
14 /<job-id>.job Description of the job (see below)
15 /<job-id>.attach/ A directory containing attachments (if any)
17 <queue>/history/<hostname>/ Successfully completed jobs (their .job, .stat and .log files)
18 are moved here if the keep_history config switch is set.
20 <queue>/log Log of actions on this queue. Lines look this way:
21 YYYY-MM-DD HH:MM:SS <host> <job-id> <status> [<msg>]
22 <status> and <msg> correspond to "Status" and "Message"
25 <queue>/status-fifo FIFO used for reporting status of subprocesses by `bprun'
29 Mail-like structure. First come the headers (<keyword>:<spaces><value>), keywords are
30 case-sensitive, no multi-line fields allowed, then an empty line and then the body
31 (i.e., commands to be executed on the remote host).
35 ID: <job-id> Identifier of the job, unique in the scope of a queue
36 Subject: <subject> Subject to be displayed to the user
37 Prep: <command> Run <command> in a shell before the job body is executed;
38 $HOST contains the name of the host. This is useful for
39 example if you want to transfer data to the host by rsync.
41 Whenever a user command wants a job ID, it accepts any substring, as long as
46 Structure identical to job headers, but they do not contain a body.
50 Time: <timestamp> UNIX timestamp of the last status change
51 Status: <code> Machine-readable status of the job (see below)
52 Message: <msg> (Optional) human-readable message explaining the status
56 FAILED Job failed to execute (i.e., it returned a non-zero exit code)
57 INTERR Internal error of BEX (e.g., failed to read job prolog file)
58 NEW Newly inserted job, which did not run yet
59 NOPING Host does not respond to ping
60 NOXFER Transfer of the job body to a temporary file on the host has failed
61 OK Job finished successfully (this is usually not seen in the queue, since
62 finished jobs are immediately deleted or moved to the history)
63 PREP Running preparatory commands (i.e., those present in Prep header field)
64 PREPFAIL Preparatory commands failed (i.e., those present in Prep header field)
65 REMOVED Job removed from the queue (behavior similar to OK)
68 Additional status codes recorded in the log files:
70 REQUEUE Attempted to put on a queue, but it already was there
72 Additional status codes sent only over status FIFO:
74 DONE Done with the host (job equals "-")
75 INIT Host or job ready, preparing to execute jobs
76 LOCKED Host or job not available, because it is locked by another brun
77 PING Trying to ping the host (job equals "-")
78 SEND Sending job to the host