Open Data

Folding@home produces molecular-dynamics trajectories at a scale no single lab could afford to generate on its own — built up over years from millions of donors contributing their idle compute. Releasing that data openly is, to us, the obvious response: the simulations were paid for by the community, and they are far more valuable when other researchers can build on them.

Everything on this page is openly accessible. No application, no fee, no signup. Download the trajectories, build new Markov state models, train ML methods on the conformational ensembles, validate experimental hypotheses, or look for the cryptic drug-binding pockets that static structures hide. Please cite the linked papers when you publish.

Markov state models of SARS-CoV-2 proteins

Bowman Lab, Washington University in St. Louis

During the COVID-19 pandemic, Folding@home donors ran simulations targeted at the molecular machinery of SARS-CoV-2: how the spike opens, how viral proteases work, where cryptic drug-binding pockets sit across the proteome. This dataset is the processed MSMs from that work. See the manuscript "SARS-CoV-2 simulations go exascale to capture spike opening and reveal cryptic pockets across the proteome" for details and headline results.

Systems covered: SARS-CoV-2 spike, NSP3, NSP5, NSP7, NSP8, NSP9, NSP10, NSP12, NSP13, NSP14, NSP15, NSP16, nucleoprotein RBD, nucleoprotein dimerization domain; SARS-CoV-1 spike; HCoV-NL63 spike; human ACE2, IL-6, and IL-6R.

Access dataset on OSF

Free-energy landscapes from Markov state models of SARS-CoV-2 proteins, including the spike opening transition
Conformational landscapes from the SARS-CoV-2 MSM dataset. (Bowman Lab, 2020)

SARS-CoV-2 raw simulation data

Bowman Lab, Washington University in St. Louis

The raw trajectories underlying the processed MSMs above. These cover SARS-CoV-2 and the associated host proteins, with an emphasis on discovering cryptic drug-binding pockets. Documentation and access patterns are catalogued at the MolSSI COVID Hub.

Access dataset on AWS Open Data Registry

Myosin motor Markov state models

Bowman Lab, Washington University in St. Louis

Myosin motors carry out an extraordinary range of biological functions despite sharing a common mechanochemical cycle. This dataset models the conformational distributions of twelve myosin motor domains, built from roughly two milliseconds of all-atom, explicit-solvent molecular dynamics — enough trajectory data to resolve how each motor tunes the thermodynamics and kinetics of its cycle.

Published as Porter et al., "Conformational distributions of isolated myosin motor domains encode their mechanochemical properties," eLife 2020.

Access dataset on OSF

Conformational distributions for twelve myosin motor domains, as Markov state model landscapes
Conformational ensembles for the twelve myosin motor domains. (Porter et al., eLife 2020, figure 3)