Loka Makes Headlines with a New Genomics Workflow

Industries

Biotech, Healthcare,
Medical Devices, Therapeutics,
Drug Discovery

Tech & TOOLS

Cromwell, Batch, FSx for Lustre,
RDS, S3, ECR, Security Groups
and IAM Roles

Teams & Services

Data Engineering, Pipeline
and HPC Orchestration, AWS
and Scientific Workflow Experts

milestones

2022: Featured in Science Advances
2023: AWS Adopts Loka’s Innovation

Industry

AI/ML
IT
Information Services

Tech & TOOLS

Node JS, Python, Azure, AWS, Postgres, Redis, Cloudflare, ML, Kubernetes and more

Teams & Services

Google Cloud Solution Architects, Backend Engineers, FrontEnd Engineers, Designers, Project Manager

milestones

Forbes 50 AI Companies to Watch
2022 Acquired by Discord

Chronic kidney disease affects 780 million people globally—about one in ten people on earth. Goldfinch wants to change that.

The goal

Seeking novel therapeutics for Chronic Kidney Disease (CKD) patients, Goldfinch Bio is combining genetics and technology to target new drugs and improve clinical treatments. Goldfinch partnered with Loka to develop systems that could run genomic structural variation pipelines on AWS. The results of the collaboration made headlines in the scientific community.

The challenge

Although Goldfinch Bio was integrated within AWS, it lacked the necessary data engineering, pipeline management and deep expertise in both cloud technologies and scientific workflow definitions. This deficit hindered its ability to improve and customize the genomics tools essential for effective pipeline operation.

The solution

Loka’s specialized knowledge of HPC, AWS Batch, open-source packages, open data sets and processes like parallelization--which enables thousands of jobs to run simultaneously--delivered best-in-class solutions faster than Goldfinch could’ve achieved on its own.

Loka deployed Cromwell, an open-sourced workflow management system geared toward scientific workflows, on an Amazon Elastic Compute Cloud instance. We launched the AWS infrastructure required to properly run Cromwell, utilizing AWS S3, AWS Batch, AWS RDS, security groups and IAM Roles.

What we delivered

Loka’s team identified and transferred input genomics data from Google Cloud to AWS, modifying the input JSONs to use AWS S3 buckets and patching the WDL files to work in AWS.

Loka then added and enriched Cromwell functionalities that improved the overall integration of Cromwell in AWS, enhancing flexibility and performance for users and companies.

The results

Loka successfully transitioned the GATK-SV pipeline, initially developed by the Broad Institute for Google Cloud, to operate on AWS. This shift led to notable enhancements in performance, speed and cost efficiency. Loka’s collaboration with Goldfinch, which landed on the July 2022 cover of Science Advances, was further celebrated when our team leader and Loka's CEO shared their insights on the AWS Health Innovation Podcast and co-authored an article about the project with Goldfinch and AWS.

>50% performance improvement

Loka decreased the execution time from three-plus days to 1.3 days.

>33% cost reduction

The enhancements implemented not only minimized the effort required but also led to a drastic reduction in the total cost of utilized EC2 instances.

AWS adopted Loka’s innovation

AWS-platformed companies can now run their pipelines markedly faster using the infrastructure Loka developed. (Find deployment instructions on GitHub.)

Goldfinch & Loka’s work grace the cover of Science Advances

Together, our hard work developing kidney disease models made headlines. We were also featured on the AWS Health Innovation Podcast and AWS Health Blog.

“Loka jumped in as an extension of our team, supporting our DevOps work and collaborating on the GATK-SV project. The team started contributing right away. I always felt like Loka’s people were on point and focused on the priorities.”

Adam Tebbe - VP Computational Data Science & Technology
Adam Tebbe
VP Computational Data Science & Technology