下载PDF
ACCELERATE: ACADEMIC RESEARCH - Researching the Genetic Basis of Behavior, Cognition and Aff ect, USC Needed a High Performance, Scalable Infrastructure to Support Next-Gen Genomics Sequencing
技术
- 分析与建模 - 大数据分析
- 基础设施即服务 (IaaS) - 云存储服务
适用行业
- 医疗保健和医院
- 生命科学
适用功能
- 产品研发
用例
- 预测性维护
- 过程控制与优化
服务
- 云规划/设计/实施服务
- 系统集成
挑战
南加州大学凯克医学院 Zilkha 神经遗传研究所的 James Knowles 博士实验室面临着巨大的挑战。该实验室专注于了解行为、认知和情感的遗传基础,但其传统的 SAN 存储服务器已接近容量上限,无法满足数据访问要求,这令其苦不堪言。网络和 NFS 的性能限制阻碍了存储吞吐量。上传速度缓慢导致的存储瓶颈延迟了发现时间。该实验室需要一种新的存储解决方案,该解决方案可以提供超过每秒 GB 的吞吐量,并在单个名称空间中扩展到 PB 级。Knowles 实验室存在数据存储性能问题。他们需要对 1,400 个完整的人类基因组进行测序,以支持他们正在进行的研究。这项工作每天会产生数 TB 的原始数据,需要传输、检查和与人类基因组对齐。他们的传统存储系统只能向 CPU 集群输出足够的数据,以便在 Pegasus MPI 工作流程下运行 Burrows-Wheeler Aligner (BWA) 的单个实例。此外,他们只能以 30-50 MB/秒的速度将数据上传到该系统,远不及 GbE 网络 100MB/秒的峰值理论容量。这个瓶颈不仅带来了不便,而且还减慢了他们的发现时间。
关于客户
Zilkha 神经遗传研究所是 USC 神经科学计划的重要组成部分,旨在促进不同学科研究人员之间的合作。它旨在促进最优秀和最聪明的研究人员之间的互动。研究所的科学家跨越界限,采用其他研究领域的方法和技术,寻找检查神经系统功能的新方法,以便我们更好地了解神经和精神疾病的根本原因。James Knowles 博士的实验室致力于了解行为、认知和情感的遗传基础。目前,该实验室的大部分工作都致力于了解大脑发育的转录程序以及精神分裂症、躁郁症和强迫症的遗传学。该实验室正在利用高通量测序技术寻找在精神疾病病因中起重要作用的遗传因素。利用这些知识,他们旨在改进诊断方法,并可能开发出治疗方法来改善该人群的生活质量。
解决方案
Knowles 实验室和 USC 的 HPCC 团队与 DDN 合作,确定了一种高性能、可扩展且经济高效的解决方案。USC 选择了基于 DDN 的 Storage Fusion Architecture® 的解决方案,在 SFA10K-E 上运行 DDN GRIDScaler 并行文件系统的嵌入式映像。该解决方案似乎满足了双方的需求,同时支持高性能并行文件系统和 NFS。Keck 医学院的其他拥有大量数据的研究小组了解到即将进行的存储部署。他们将资源投入到 GRIDScaler 的购买中,然后将原始系统容量翻倍至 1PB 以上。GRIDScaler 是一种大型高性能存储设备,具有共享架构 - 能够持续为 USC HPCC 集群提供超过 8 亿 IOPS。与使用较小、性能较低的设备相比,这提供了明显的优势,这也需要更多的 HPCC 管理。存储阵列通过多个 10Gb-E 连接到计算机资源。有一个单独的 10Gb-E 连接到头节点,该节点运行 GRIDScaler 客户端软件的映像并充当 NFS 服务器。Knowles 实验室中的缓存服务器提供到 HPCC 头节点的长距离连接,并充当 Knowles 实验室中 Windows/Linux 终端和仪器的数据传输网关。
运营影响
数量效益
相关案例.
Case Study
Hospital Inventory Management
The hospital supply chain team is responsible for ensuring that the right medical supplies are readily available to clinicians when and where needed, and to do so in the most efficient manner possible. However, many of the systems and processes in use at the cancer center for supply chain management were not best suited to support these goals. Barcoding technology, a commonly used method for inventory management of medical supplies, is labor intensive, time consuming, does not provide real-time visibility into inventory levels and can be prone to error. Consequently, the lack of accurate and real-time visibility into inventory levels across multiple supply rooms in multiple hospital facilities creates additional inefficiency in the system causing over-ordering, hoarding, and wasted supplies. Other sources of waste and cost were also identified as candidates for improvement. Existing systems and processes did not provide adequate security for high-cost inventory within the hospital, which was another driver of cost. A lack of visibility into expiration dates for supplies resulted in supplies being wasted due to past expiry dates. Storage of supplies was also a key consideration given the location of the cancer center’s facilities in a dense urban setting, where space is always at a premium. In order to address the challenges outlined above, the hospital sought a solution that would provide real-time inventory information with high levels of accuracy, reduce the level of manual effort required and enable data driven decision making to ensure that the right supplies were readily available to clinicians in the right location at the right time.
Case Study
Gas Pipeline Monitoring System for Hospitals
This system integrator focuses on providing centralized gas pipeline monitoring systems for hospitals. The service they provide makes it possible for hospitals to reduce both maintenance and labor costs. Since hospitals may not have an existing network suitable for this type of system, GPRS communication provides an easy and ready-to-use solution for remote, distributed monitoring systems System Requirements - GPRS communication - Seamless connection with SCADA software - Simple, front-end control capability - Expandable I/O channels - Combine AI, DI, and DO channels
Case Study
Driving Digital Transformations for Vitro Diagnostic Medical Devices
Diagnostic devices play a vital role in helping to improve healthcare delivery. In fact, an estimated 60 percent of the world’s medical decisions are made with support from in vitrodiagnostics (IVD) solutions, such as those provided by Roche Diagnostics, an industry leader. As the demand for medical diagnostic services grows rapidly in hospitals and clinics across China, so does the market for IVD solutions. In addition, the typically high cost of these diagnostic devices means that comprehensive post-sales services are needed. Wanteed to improve three portions of thr IVD:1. Remotely monitor and manage IVD devices as fixed assets.2. Optimizing device availability with predictive maintenance.3. Recommending the best IVD solution for a customer’s needs.
Case Study
HaemoCloud Global Blood Management System
1) Deliver a connected digital product system to protect and increase the differentiated value of Haemonetics blood and plasma solutions. 2) Improve patient outcomes by increasing the efficiency of blood supply flows. 3) Navigate and satisfy a complex web of global regulatory compliance requirements. 4) Reduce costly and labor-intensive maintenance procedures.
Case Study
Harnessing real-time data to give a holistic picture of patient health
Every day, vast quantities of data are collected about patients as they pass through health service organizations—from operational data such as treatment history and medications to physiological data captured by medical devices. The insights hidden within this treasure trove of data can be used to support more personalized treatments, more accurate diagnosis and more advanced preparative care. But since the information is generated faster than most organizations can consume it, unlocking the power of this big data can be a struggle. This type of predictive approach not only improves patient care—it also helps to reduce costs, because in the healthcare industry, prevention is almost always more cost-effective than treatment. However, collecting, analyzing and presenting these data-streams in a way that clinicians can easily understand can pose a significant technical challenge.