This plugin uses Gearman to support multiple Jenkins masters.
Plugin Information |
---|
View Gearman on the plugin site for more information. |
The current version of this plugin may not be safe to use. Please review the following warnings before use:
Summary
We on Openstack infrastructor team use Jenkins extensively. Our jenkins servers, at peak load, runs 7000+ jobs per day. At that load we require many jenkins slaves (200+) to process all of those build jobs. We have found that our requirement was pushing Jenkins beyond it's limits therefore we've decided to create the Gearman Plugin to support multiple Jenkins masters. The gearman plugin was designed to support extra slaves, allow load balancing of build jobs, and provide redundancy.
Jenkins core does not support multiple masters. You can setup multiple Jenkins masters but there is no coordination between them. One problem with scheduling builds on Jenkins master (“MasterA”) server is that MasterA only knows about its connected slaves. If all slaves on MasterA are busy then MasterA will just put the next scheduled build on the jenkin server queue. Now MasterA needs to wait for an available slave to run the build. This will be very in-efficient if your builds take a long time to run. So.. what if there is another Jenkins master (“MasterB”) that has free slaves to service the next scheduled build on the server's queue? Your probably saying.. “Then slaves on MasterB should run the build instead of waiting for slaves on MasterA to run them”, then I would say "good thought!". However MasterB will never service the builds on MasterA's queue. The client that schedules the builds must know about MasterB and then schedule builds on MasterB. This is what we mean by lack of coordination between masters. This gearman-plugin attempts to fill the gap.
This plugin integrates Gearman with Jenkins and will make it so that any Jenkins slave on any Jenkins master can service a job in the queue. This plugin will essentially replace the Jenkins (master) build queue with the Gearman job queue. The job will stay in the Gearman queue until there is a Jenkins node (master or slave) that can run that job. The gearman job queue is shared by multiple jenkins masters therefore gearman can hand out jobs to the next available slave on any jenkins master.
Getting Started
This assumes some familiarity with Jenkins and Gearman
Install
- Install the Gearman plugin like any other Jenkins plugin, refer to the Jenkins documentation. You can also get the plugin directly from the Jenkins CI Repository
- If you don't already have a Gearman server up and running somewhere you should install one. This plugin will work with the following gearman servers:
- The python gear package (recommended). This is the one we use and test with.
- The C gearmand server ver 1.1.7 and later. To get later versions of gearman server you may need to build from source.
- Theoretically the plugin should work with any gearman server, but we've only used it with the python gear implementation.
Configuration
The Gearman plugin configuration should appear in the Jenkins global configuration page. Click on the help bubbles if you need additional help with the configuration. You should test the connection to your Gearman Server before running the workers. The 'Enable Gearman' checkbox will start the gearman workers for the Jenkins server.
Workflow
Starting the Gearman workers:
- When the gearman plugin is enabled a gearman worker thread is spawned for each executor on the master and slave nodes. We'll call these "executor worker threads". Each executor worker thread is associated 1:1 with a Jenkins executor.
- We spawn one more thread to be a Gearman worker to handle job management. We'll call it the "management worker thread" and it will register a "stop:$hostname" and "set_description:$hostname" Gearman function. We use these functions to abort and rename jenkins jobs.
- The gearman plugin will register gearman functions for each Gearman executor depending on the projects, labels and nodes that have been setup on the Jenkins master. You can check the registered gearman functions using the administration protocol. It should look something like this..
Notes:
- Red text denotes gearman admin commands
- Blue text denotes gearman workers. There is a default manager worker for the master and an executor worker for a jenkins executor on master. There are two gearman executor workers for oneiric-668599 slave (exec-0 & exec-1). These executor workers map to two jenkins executors on the oneiric-668599 slave.
- Functions like "build:guava:ubuntu" map to build:$projectName:$nodeLabel"
Here's the corresponding Jenkins:
Running a Jenkins build:
Any Gearman client (in any language) can connect to the Gearman server and send a request to build a Jenkins job. The client just needs to provide the Gearman hostname, port, function, and UUID to start a jenkins build. Below is an example in python using the python gear package..
import gear import simplejson import uuid import time #begin time print time.asctime( time.localtime(time.time()) ) # Submit job build_id = uuid.uuid4().hex jenkins_build_params = {'param1':"cat",'param2':"dog",'param3':'bird','uuid':build_id} job_name = 'build:guava' client = gear.Client() client.addServer('MyGearmanServer') job = gear.Job(job_name, simplejson.dumps(jenkins_build_params), unique=build_id) client.waitForServer() # Wait for at least one server to be connected client.submitJob(job) print "Sending job "+job_name+" with uuid="+build_id+" and params= "+str(jenkins_build_params) # wait for job to complete and show info finished = False while not finished: print "status = %s" % job.fraction_complete if (job.complete or job.failure or job.exception): finished = True print "completed = %s" % str(job.complete) print "failure = %s" % str(job.failure) print "exception = %s" % str(job.exception) print "warning = %s" % str(job.warning) print "data = %s" % str(job.data) print time.asctime( time.localtime(time.time()) ) client.shutdown() time.sleep(1);
- For java examples look in github for the Gearman StartJob client java examples.
- The Gearman workers which were spawned by the Gearman Plugin will service any client request that come through to start and abort a Jenkins build.
Stopping/Cancelling a gearman/jenkins job:
- Any Gearman client can connect to the Gearman server and send a request to stop a job.
- The job is stopped differently depending on the current state of the job. The table below explains the state, transitions and how cancellations happen.
State |
Transitions |
Cancellation |
---|---|---|
Gearman queue |
Sending a job request to gearman puts it on the gearman queue |
the job is removed from the gearman queue |
Jenkins queue |
jobs on the gearman queue will transition to the jenkins queue |
the job is removed from the Jenkins queue |
Jenkins executor |
job on the jenkins queue transition to the jenkins executor to run |
the job is aborted in jenkins |
import gear import simplejson import uuid import time #begin time print time.asctime( time.localtime(time.time()) ) # Submit job build_id = uuid.uuid4().hex jenkins_build_params = {'name':"guava",'number':"5"} job_name = 'stop:MyGearmanServer' client = gear.Client() client.addServer('MyGearmanServer') job = gear.Job(job_name, simplejson.dumps(jenkins_build_params), unique=build_id) client.waitForServer() # Wait for at least one server to be connected client.submitJob(job) print "Sending job "+job_name+" and params= "+str(jenkins_build_params) # wait for job to complete and show infofinished = False while not finished: print "status = %s" % job.fraction_complete if (job.complete or job.failure or job.exception): finished = True print "completed = %s" % str(job.complete) print "failure = %s" % str(job.failure) print "exception = %s" % str(job.exception) print "warning = %s" % str(job.warning) print "data = %s" % str(job.data) print time.asctime( time.localtime(time.time()) ) client.shutdown() time.sleep(1);
Versions
- 0.0.1 - initial release.
- 0.0.3 - added features:
- ability to cancel gearman jobs from it's queue
- ability to set jenkins job descriptions
TODO
- Dynamically update Gearman functions relative to an update to the Jenkins executors.
- This pliugin does not register functions correctly for Jenkins Matrix Projects.
Plugin In Action
Plugin In Production
The above images just show how the plugin might work in a simple case. To see the plugin used in production check out openstack jenkins servers, yes that's servers with an s:
- jenkins - we use this master to run operational jobs
- jenkins01 - we use this master to run openstack project builds
- jenkins02 - this is essentially a mirror of jenkins01.
jenkins01 and jenkins02 use this plugin which means both masters can run any jobs that are sent to gearman.