SGE Consumable Resources

Question

We're using SGE (Sun Grid Manager). We have some limitations on the total number of concurrent jobs from all users.

I would like to know if it's possible to set a temporary, voluntary limit on the number of concurrent running jobs for a specific user.

For example user dave is about to submit 500 jobs, but he would like no more than 100 to run concurrently, e.g. since he knows the jobs do lots of I/O which stuck the filesytem (true story, unfortunately).

Is that possible?

edited Sep 24 '10 at 6:10

Kamil Kisiel
8,12412250

asked Sep 24 '10 at 0:25

David B
14313

add a comment |

1 Answer 1

active oldest votes

score 9 · Accepted Answer

up vote9down voteaccepted

You can define a complex with qconf -mc. Call it something like high_io or whatever you'd like, and set the consumable field to YES. Then in either the global configuration with qconf -me global or in a particular queue with qconf -mq <queue name> set high_io=500 in the complex values. Now tell your users to specify -l high_io=1 or however many "tokens" you'd like them to use. This will limit the number of concurrent jobs to whatever you set the complex value to.

The other way to do this is with quotas. Add a quota with qconf -arqs that looks something like:

 {
        name         dave_max_slots
        description  "Limit dave to 500 slots"
        enabled      true
        limit        users {dave} to slots=500
 }

edited Sep 28 '10 at 22:40

answered Sep 24 '10 at 6:06

Kamil Kisiel
8,12412250

Thanks Kamil and sorry for the late reply. A couple of follow-ups, since I'm quite new to qconf. Regarding your first suggestion, could you be a bit more explicit? What is "consumable"? After configuring as mentioned, fo I simply tell the user to qsub with -l high_io=1? – David B Sep 28 '10 at 9:39

Basically a complex is a resource of value that can be requested by a job with the -l switch to


													qsub

. By setting a complex to be consumable, it means that when a job requests that complex the number available is decreased. So if a queue has 500 of the high_io complex, and a job requests 20, there will be 480 available for other jobs. You'd request the complex just as in your example. – Kamil Kisiel Sep 28 '10 at 22:42

Thank you Kamil. Sorry I can't vote up (not enough reputation yet). – David B Oct 1 '10 at 9:08

Softpanorama May the source be with you, but remember the KISS principle ;-)	Home	Switchboard	Unix Administration	Red Hat	TCP/IP Networks	Neoliberalism	Toxic Managers
	(slightly skeptical) Educational society promoting "Back to basics" movement against IT overcomplexity and bastardization of classic Unix

News	SGE Resource Quota Sets	Recommended Links	Slot limits and restricting number of slots per server	SGE Consumable Resources	License tokens processing and limitation of the number of concurrent jobs	slots queue attribute
SGE Queues	Grid Engine Config Tips	SGE Parallel Environment	Configuring Hosts From the Command Line	Perl Admin Tools and Scripts	Humor	Etc

Top updates <p>Your browser does not support iframes.</p>
Bulletin	Latest	Past week	Past month	Google Search

Top Visited <p>Your browser does not support iframes.</p>
Bulletin	Latest	Past week	Past month	Google Search

SGE Consumable Resources

Introduction

Types of consumables

Defining consumables

NEWS CONTENTS

Old News ;-)

1 Answer 1

Reserving resources (RAM, disc, GPU) by MerlinWiki

Known problems with SGE

Parallel jobs - OpenMP

Parallel jobs - OpenMPI

Added by John Pormann, last edited by John Pormann

Jul 16, 2008

amazon ec2 - SGE Auto configured consumable resource - Server Fault

gpgpu - Scheduling GPU resources using the Sun Grid Engine (SGE) - Stack Overflow

Recommended Links

Etc