gqsub

gqsub is a package that offers the familiar POSIX qsub interface to grid jobs. It is intented to reduce the cognitive gap the exists between local cluster use, and Grid job submission. Depending on the configuration of the local cluster, it may be possible to run submission scripts unmodified on the Grid, using gqsub. More commonly, it takes a few small modifications to produce a submission script that can run on either the local cluster, or remotely on the Grid.

This gives a mechanism for introducing experienced cluster users to the Grid, without a large barrier to entry.

It depends on the gLite 3.1 or 3.2 UI packages, but can be installed by an end user, or for the whole machine. 'Installation' is a simple matter of putting the files somewhere.

With the 1.3.0 release, gqsub offers end to end (submission, monitoring and output retrival) job management. Use gqsub to submit the job, and gqstat to monitor. Once gqstat shows that a job is done, the output is back in the directory the job was launched from. There's a few tricks it uses to optimise this, but they are not required.

1.4.0 splits up the internals a bit, and allows for plugable credential systems and submission engines. The only submission engine currently supported is gLite. Credential support covers simple VOMS proxies, and the use of MyProxy servers for credential renewal. Put 'credentials = MyProxy' in your .gqsubrc file, and away you go. Also new is the /etc/gqsubrc file, for system wide options.

1.5.0 gives data handling - staging data in and out of Storage Elements. The general method of use is a generalisation of the stagein / stageout syntax. File are specified as localName@remoteLocation, where if the localName and the @ are ommited, it defaults to the same filename as the remote host. Support remoteLocation specifiations are:

Efficency is not quite optimal, but it works. That means that occasionally a file with multiple replicas might be pulled not from the closest host; if there are multiple files to be pulled from Storage Elements. Efficency improvements will occur over the next few releases

1.5.2 release fixes a problem with gqdel, and disables deep resubmission for jobs doing direct stage out to gridFTP. There is a repo at http://www.scotgrid.ac.uk/gqsub/repo/

There is a blog to cover development and other chatter

Roadmap

Key features planned are listed, along with (sometimes) an expected date for release. Release dates are not (sadly) guaranteed, but a best estimate.

There are some features that are readily identifiable as interesting, but also not clear that there is user demand for them. These are listed as 'unscheduled'. Should you find one of these particularly interesting, do let me know (s dot purdie at physics dot gla dot ac dot uk), and I'll amend the plan.

Key milestoneVersionEstimated Release dateNotes
Whole directory submission [0]1.6.0Late 2011 
Handle ARC submission1.7.0Late 2011 
Extension to handle MPI jobs1.8.0 
Direct OGSA-BES submissionUnscheduled 
Handle GRAM submissionUnscheduled 
Job sequencing directivesUnscheduledBlog post

[0]: Allow users to use explicit file staging, without worrying over the details. Ship everything in the current directory (and subdirs) to the worker node. After the job, ship back everything that's changed. Perhaps not hte most efficent, but for many users this is much simpler to handle. With thanks to Jan Just Keijser for the idea.

The HTML manual is available

Download
VersionRelease dateTarballOnline manualNotes
1.0.07 Sep 2009gqsub-1.0.0.tgzmanual 
1.1.021 Sep 2009gqsub-1.1.0.tgzmanualRemoved dependance on python32, and removed assumptions about environment variables.
1.2.024 Sep 2009gqsub-1.2.0.tgzmanual Auto return of stage out data, via GridFTP
1.3.126 Oct 2009gqsub-1.3.1.tgzmanual Auto collection of stage out data, when gqstat identifies the job is complete
1.3.216 Dec 2009gqsub-1.3.2.tgzmanual JDL escape valve, for JDL features not supported nativly
1.4.14 Feb 2010gqsub-1.4.1.tgzmanual Mutlti credentials use - allows for proper myproxy support
1.5.26 Dec 2010gqsub-1.5.2.tar.gzmanual Data mangement - stage in and out of data to SE's


Last modified Tue 12 April 2011 . View page history
Switch to HTTPS . Print View . Built with GridSite 1.5.1