From: Peter Amstutz Date: Fri, 14 Feb 2020 15:19:00 +0000 (-0500) Subject: 16080: Align descriptive text with new website X-Git-Tag: 2.1.0~299^2~1 X-Git-Url: https://git.arvados.org/arvados.git/commitdiff_plain/3d4de36a24221e499ed944f5472925581d4e276a 16080: Align descriptive text with new website Arvados-DCO-1.1-Signed-off-by: Peter Amstutz --- diff --git a/README.md b/README.md index 6ed29f391c..2f6af250e7 100644 --- a/README.md +++ b/README.md @@ -6,24 +6,51 @@ -[Arvados](https://arvados.org) is a free software distributed computing platform -for bioinformatics, data science, and high throughput analysis of massive data -sets. Arvados supports a variety of cloud, cluster and HPC environments. +[Arvados](https://arvados.org) is an open source platform for +managing, processing, and sharing genomic and other large scientific +and biomedical data. With Arvados, bioinformaticians run and scale +compute-intensive workflows, developers create biomedical +applications, and IT administrators manage large compute and storage +resources. -Arvados consists of: +The key components of Arvados are: -* *Keep*: A petabyte-scale content-addressed distributed storage - system for storing, managing and versioning collections of files. - Like git for big data. Interoperable data access by a variety of - methods including WebDAV, FUSE file system mount, and Arvados APIs. +## Keep -* *Crunch*: A container-based cloud and HPC workflow engine providing - strong versioning, reproducibilty, and provenance of large-scale - computations. Supports [Common Workflow - Language](https://www.commonwl.org) for describing workflows. +Keep is the Arvados storage system for managing and storing large +collections of files. Keep combines content addressing and a +distributed storage architecture resulting in both high reliability +and high throughput. Every file stored in Keep can be accurately +verified every time it is retrieved. Keep supports the creation of +collections as a flexible way to define data sets without having to +re-organize or needlessly copy data. Keep works on a wide range of +underlying filesystems and object stores. -* Related services and components including a web workbench for managing files - and compute jobs, REST APIs, SDKs, and other tools. +## Crunch + +Crunch is the orchestration system for running [Common Workflow Language](https://www.commonwl.org) workflows. It is +designed to maintain data provenance and workflow +reproducibility. Crunch automatically tracks data inputs and outputs +through Keep and executes workflow processes in Docker containers. In +a cloud environment, Crunch optimizes costs by scaling compute on demand. + +## Workbench + +The Workbench web application allows users to interactively access +Arvados functionality. It is especially helpful for querying and +browsing data, visualizing provenance, and tracking the progress of +workflows. + +## Command Line + +The command line interface (CLI) provides convenient access to Arvados +functionality in the Arvados platform from the command line. + +## API and SDKs + +Arvados is designed to be integrated with existing infrastructure. All +the services in Arvados are accessed through a RESTful API. SDKs are +available for Python, Go, R, Perl, Ruby, and Java. # Quick start