gqsub

gqsub is a package that offers the familiar POSIX qsub interface to grid jobs. It is intented to reduce the cognitive gap the exists between local cluster use, and Grid job submission. Depending on the configuration of the local cluster, it may be possible to run submission scripts unmodified on the Grid, using gqsub. More commonly, it takes a few small modifications to produce a submission script that can run on either the local cluster, or remotely on the Grid.

This gives a mechanism for introducing experienced cluster users to the Grid, without a large barrier to entry.

It depends on the gLite 3.1 or 3.2 UI packages, but can be installed by an end user, or for the whole machine. 'Installation' is a simple matter of putting the files somewhere.

With the 1.3.0 release, gqsub offers end to end (submission, monitoring and output retrival) job management. Use gqsub to submit the job, and gqstat to monitor. Once gqstat shows that a job is done, the output is back in the directory the job was launched from. There's a few tricks it uses to optimise this, but they are not required.

1.3.2 lets you specify two things for direct use of JDL - --cerequirements which is &&'d with any other generated CERequirements, and --jdl which should be a complete JDL line (although the terminating semi-colon can be skipped), which is dropped in verbatim. This allows for things like selecting (or preventing) a Cream CE with CERequirements, or using --jdl MyProxyServer="lcgrbp01.gridpp.rl.ac.uk". Note that the string should not be escaped, should not have spaces in it, and doesn't need a semi colon.

1.4.0 splits up the internals a bit, and allows for plugable credential systems and submission engines. The only submission engine currently supported is gLite. Credential support covers simple VOMS proxies, and the use of MyProxy servers for credential renewal. Put 'credentials = MyProxy' in your .gqsubrc file, and away you go. Also new is the /etc/gqsubrc file, for system wide options.

There is a blog to cover development and other chatter

Roadmap

Key features planned are listed, along with (sometimes) an expected date for release. Release dates are not (sadly) guaranteed, but a best estimate.

There are some features that are readily identifiable as interesting, but also not clear that there is user demand for them. These are listed as 'unscheduled'. Should you find one of these particularly interesting, do let me know (s dot purdie at physics dot gla dot ac dot uk), and I'll amend the plan.

Key milestoneVersionEstimated Release date
Data management interface1.5.0Mar 2010
Whole directory submission [0]1.6.0Apr 2010
Handle ARC submission1.7.0May 2010
Extension to handle MPI jobs1.8.0
Direct OGSA-BES submissionUnscheduled
Handle GRAM submissionUnscheduled

[0]: Allow users to use explicit file staging, without worrying over the details. Ship everything in the current directory (and subdirs) to the worker node. After the job, ship back everything that's changed. Perhaps not hte most efficent, but for many users this is much simpler to handle. With thanks to Jan Just Keijser for the idea.

The HTML manual is available

Download
VersionRelease dateTarballOnline manualNotes
1.0.07 Sep 2009gqsub-1.0.0.tgzmanual 
1.1.021 Sep 2009gqsub-1.1.0.tgzmanualRemoved dependance on python32, and removed assumptions about environment variables.
1.2.024 Sep 2009gqsub-1.2.0.tgzmanual Auto return of stage out data, via GridFTP
1.3.126 Oct 2009gqsub-1.3.1.tgzmanual Auto collection of stage out data, when gqstat identifies the job is complete
1.3.216 Dec 2009gqsub-1.3.2.tgzmanual JDL escape valve, for JDL features not supported nativly
1.4.14 Feb 2010gqsub-1.4.1.tgzmanual Mutlti credentials use - allows for proper myproxy support


Last modified Fri  5 February 2010 . View page history
Switch to HTTPS . Print View . Built with GridSite 1.5.1