Download PDF
Clemson University's Adoption of PBS Professional for Enhanced HPC Workload Management
Technology Category
- Application Infrastructure & Middleware - Data Visualization
- Networks & Connectivity - Ethernet
Applicable Industries
- Cement
- Education
Applicable Functions
- Procurement
- Product Research & Development
Use Cases
- Inventory Management
- Smart Campus
Services
- System Integration
- Training
The Challenge
Clemson University's IT department, Clemson Computing and Information Technology (CCIT), was facing a significant challenge in managing the workload of their rapidly growing user base. The department utilized the Palmetto cluster, a 17,032-core, 262 TFlop HPC system, as the university's primary HPC resource. This system was heavily used by the university's faculty, staff, students, and 144 external users, including researchers and faculty from other universities. The cluster operated on a 'condo model', where users could purchase nodes for their own priority usage. However, the open-source Maui scheduler previously used by CCIT was unable to handle the scalability and reliability needs of their expanding user base. The system frequently crashed and some advanced features did not function properly, leading to unreliability with the scheduler.
About The Customer
Clemson University is a major land-grant, science- and engineering-oriented research university that ranks in the top 25 among national public universities. The university is committed to teaching and student success, fostering an inclusive, student-centered community characterized by high academic standards, a culture of collaboration, school spirit, and a competitive drive to excel. The university's IT department, Clemson Computing and Information Technology (CCIT), provides cyberinfrastructure resources and advanced research computing capabilities. CCIT supports an array of advanced computing infrastructure made possible through the integration of high-performance computing (HPC), high-performance networks, data visualization, storage architectures, and middleware.
The Solution
To address the challenges, CCIT decided to adopt a commercial-grade workload management solution. After evaluating several vendors, they chose Altair’s PBS Professional® for its massive scalability and technical support. The PBS Professional scheduling software was able to meet the HPC needs of the university, providing reliability and scalability that the previous open-source tool could not handle. Altair's technical team provided comprehensive support, helping CCIT understand the advanced features of PBS Professional before purchase and offering hands-on training before the installation process. The cost was also a crucial factor in the decision-making process. Altair was able to provide an attractive academic pricing offer that fit within CCIT's budget. The implementation of PBS Professional began in September 2011, supporting 1,623 nodes. Today, the node count has increased to 1,804, and PBS Professional can easily scale to support additional nodes for the rapidly growing user base.
Operational Impact
Quantitative Benefit
Related Case Studies.
Case Study
System 800xA at Indian Cement Plants
Chettinad Cement recognized that further efficiencies could be achieved in its cement manufacturing process. It looked to investing in comprehensive operational and control technologies to manage and derive productivity and energy efficiency gains from the assets on Line 2, their second plant in India.
Case Study
IoT platform Enables Safety Solutions for U.S. School Districts
Designed to alert drivers when schoolchildren are present, especially in low-visibility conditions, school-zone flasher signals are typically updated manually at each school. The switching is based on the school calendar and manually changed when an unexpected early dismissal occurs, as in the case of a weather-event altering the normal schedule. The process to reprogram the flashers requires a significant effort by school district personnel to implement due to the large number of warning flashers installed across an entire school district.
Case Study
Digital Transformation of Atlanta Grout & Tile: An IoT Case Study
Atlanta Grout & Tile, a Tile, Stone & Grout restoration company based in Woodstock, Georgia, was facing challenges with its traditional business model. Despite steady growth over the years, the company was falling behind the web revolution and missing out on the opportunity to tap into a new consumer base. They were using independent software from different vendors for each of their department information and workforce management. This resulted in a lot of manual work on excel and the need to export/import data between different systems. This not only increased overhead costs but also slowed down their response to clients. The company also had to prepare numerous reports manually and lacked access to customer trends for effective business decision-making.
Case Study
Revolutionizing Medical Training in India: GSL Smart Lab and the LAP Mentor
The GSL SMART Lab, a collective effort of the GSL College of Medicine and the GSL College of Nursing and Health Science, was facing a challenge in providing superior training to healthcare professionals. As clinical medicine was becoming more focused on patient safety and quality of care, the need for medical simulation to bridge the educational gap between the classroom and the clinical environment was becoming increasingly apparent. Dr. Sandeep Ganni, the director of the GSL SMART Lab, envisioned a world-class surgical and medical training center where physicians and healthcare professionals could learn skills through simulation training. He was looking for different simulators for different specialties to provide both basic and advanced simulation training. For laparoscopic surgery, he was interested in a high fidelity simulator that could provide basic surgical and suturing skills training for international accreditation as well as specific hands-on training in complex laparoscopic procedures for practicing physicians in India.
Case Study
Implementing Robotic Surgery Training Simulator for Enhanced Surgical Proficiency
Fundacio Puigvert, a leading European medical center specializing in Urology, Nephrology, and Andrology, faced a significant challenge in training its surgical residents. The institution recognized the need for a more standardized and comprehensive training curriculum, particularly in the area of robotic surgery. The challenge was underscored by two independent studies showing that less than 5% of residents in Italian and German residency programs could perform major or complex procedures by the end of their residency. The institution sought to establish a virtual reality simulation lab that would include endourological, laparoscopic, and robotic platforms. However, they needed a simulator that could replicate both the hardware and software of the robotic Da Vinci console used in the operating room, without being connected to the actual physical console. They also required a system that could provide both basic and advanced simulation training, and a metrics system to assess the proficiency of the trainees before they performed surgical procedures in the operating theater.