Skip to content

lyskov/rosetta

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Rosetta Biomolecular Modeling Library

The Rosetta software suite includes algorithms for computational modeling and analysis of protein structures. It has enabled notable scientific advances in computational biology, including de novo protein design, enzyme design, ligand docking, and structure prediction of biological macromolecules and macromolecular complexes.

Rosetta is maintained by the RosettaCommons, a collaboration of 50+ academic research groups, who have been developing Rosetta for over 20 years. See https://www.rosettacommons.org for more information about Rosetta and the RosettaCommons.

Rosetta Code

While the Rosetta source code is published on GitHub, it is not "Open Source" (according to the OSI definition). Most notably, use for commercial purposes requires purchase of a separate license. See LICENSE.md for further information.

The main GitHub repository on https://github.com/RosettaCommons/rosetta integrates all the Rosetta-associated code base. It should be noted that many parts of Rosetta are structured as separate GitHub repositories, which the main repository conveniently presents as submodules. A significant number are external software packages that Rosetta is redistributing, which helps synchronize development to exact versions of those external softwares across software distributions and thus support users and developers in their communication.

It is not required to manually retrieve those subprojects. Instead, the scons build management will auto-retrieve only those external repositories that are required for the build. The retrieval of external source trees is independent from the software that may already be installed via the Operating System, which is intentional.

git clone https://github.com/RosettaCommons/rosetta

and change to that directory

cd rosetta

Getting Started Using Rosetta

Start here: https://www.rosettacommons.org/docs/latest/getting_started/Getting-Started

The fastest way to address your scientific modeling challenge at hand may be with the official Docker images. Accessible from the official Docker hub at https://hub.docker.com/r/rosettacommons/rosetta, the images have both Rosetta and PyRosetta pre-installed, such that Rosetta tutorials can be followed. You may also conveniently inspect the Dockerfiles (see docker/README.md) as a templated for the Rosetta installation on your local system.

Questions about how to use Rosetta are best directed to the RosettaCommons forums https://www.rosettacommons.org/forum

Installing using Conda

Rosetta binaries are avaliable as a rosetta Conda package in the RosettaCommons Conda Channel. All binaries are built using serialization and cxx11thread extras. Currently RosettaCommons has two mirrors of this channel. To use them please edit ~/.condarc and add snippets for either East or West mirrors:

Example ~/.condarc for US WEST coast (if unsure use this mirror):

channels: 
- https://conda.rosettacommons.org
- conda-forge

Example ~/.condarc for US EAST coast:

channels: 
- https://conda.graylab.jhu.edu
- conda-forge

Compiling Rosetta

To use Rosetta without Docker, or to modify and extend Rosetta yourself, you can compile the Rosetta source tree yourself. (See also https://www.rosettacommons.org/docs/latest/build_documentation/Build-Documentation for details.)

The Rosetta source tree ships with all its run-time dependencies, just when building you need to install the C++ compiler (g++ or clang). Also you need the scripting language Python to be installed, The compilation is then performed by:

$ cd source
$ ./scons.py -j<NumOfJobs> mode=release bin

The Rosetta source tree is big and uses a series of advanced features of the C++ language. While we endeavor to support most compilers where possible, some C++ compilers may not yet perfectly master these features. Later versions of those compilers may have that fixed. Please let us know if you run into issues.

Docker

Official Rosetta/PyRosetta images could be found at https://hub.docker.com/r/rosettacommons/rosetta. Both serial and mpi Rosetta builds provided as well as the number of PyRosetta builds including fully functional Jupyter setups with PyRosetta pre-installed and experimenta builds with libtorch and tensorflow integration. Please see https://hub.docker.com/r/rosettacommons/rosetta for more information.

Various reference Docker files could be found in rosetta/docker dir.

PyRosetta

PyRosetta are Python bindings to the Rosetta library. These can be built from the Rosetta source code.

See https://www.pyrosetta.org for more information about PyRosetta.

The Docker image referenced above already ships with PyRosetta. To prepare the PyRosetta Python module locally from this source tree, you need

  • a C++ compiler, like the one you used to compile the other parts of Rosetta
  • the Ninja (conda, Debian) build management tool
  • and also CMake (conda, Debian) which should all be readily available from your regular Linux distribution.
$ cd source/src/python/PyRosetta
$ python3 build.py -j24 --create-package $HOME/my_pyrosetta_package
$ cd $HOME/my_pyrosetta_package/setup
$ python3 setup.py install

Developing Rosetta

We welcome contributions to improve Rosetta. We use a fork-and-PR system for contribution. To contribute to Rosetta, please fork the Rosetta repo(s) under your own Github user space. You can then develop your additions in your own space. Once you're ready to contribute it back, open a PR agaist the main Rosetta repos. You will need to sign the Rosetta Contributor License Agreement before your contribution can be accepted.

See CONTRIBUTING.md for more details.

Rosetta Code Organization

Due to its size, Rosetta uses git submodules to help in organization.

The main repository (RosettaCommons/rosetta) contains the Rosetta source code, database, unit test and integration tests

  • rosetta/source/src -- The Rosetta source
  • rosetta/database/ -- The Rosetta database (used during runtime)
  • rosetta/source/test/ -- The compiled unit tests
  • rosetta/tests/integration/ -- The integration tests
  • rosetta/source/bin/ -- The location of the (symlinks to) the Rosetta executables -- (created during compilation)
  • rosetta/source/build/ -- The location of the built libraries -- (created during compilation)

Additional information is located in submodules:

The default clone does not pull down submodules, leaving an empty directory. The compilation script will automatically clone the submodules needed for compilation, but not others. To obtain the contents of submodules which aren't currently cloned:

git submodule update --init -- ./submodule_directory_name
# e.g. git submodule update --init -- ./tools/

or if you want to get all the submodules

git submodule update --init --recursive

About

The Rosetta Bio-macromolecule modeling package.

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 69.9%
  • Roff 16.8%
  • MATLAB 5.6%
  • Python 4.6%
  • Ruby 1.2%
  • R 0.8%
  • Other 1.1%