For my job I've been investigating clustering software. We have some jobs that use up to 4Gb of RAM and can take weeks to run (though most are much smaller), we want to build a farm which can efficiently utilize hardware resources of a set server pool and developer workstations.

My current ideal list of requirements is below:

Here is the list of software I've investigated to various degrees.

Open Source Applications

Commercial Applications

Alfred

(SgiIrix, RedhatLinx) Written by Pixel, a specalized for film renders. Has some problems scaling to a very large number of hosts.

Platform LSF

(SunSolaris, MicrosoftWindows, SgiIrix, RedhatLinux, DebianLinux, more ...) The uber scheduling program that seems to exist everywhere. Very queue centric and strangly inflexible in some ways.

http://www.platform.com/index.html

Rush

(MicrosoftWindows, SgiIrix, RedhatLinux, AppleOsx) Don't know much about this.

http://seriss.com/rush/

Helper Utilities

ftsh
The fault tolerant shell. Looks very cool for writing scripts to run on lots of hosts in a cluster.

http://www.cse.nd.edu/~ccl/software/ftsh/

x-CAT
Extreme Cluster Administration Toolkit by IBM

http://www.x-cat.org/

Other Related Projects

Other Research

Net Booting


CategoryGeek

ClusterSoftwareResearch (last edited 2005-03-08 02:23:59 by AdamShand)