Tutorial: EMPIAR-10499 70S ribosomes#
Here we present tutorials for processing heterogeneous ribosome data from cryo-ET benchmark dataset EMPIAR-10499, as described in our tomoDRGN manuscript. The tutorials cover the following stages of processing:
upstream processing and obtaining input data for tomoDRGN
validating that particles and metadata were extracted correctly:
tomodrgn backproject_voxel, ortomodrgn train_nnwithtomodrgn convergence_nnlearning structural heterogeneity within the dataset:
tomodrgn train_vaewithtomodrgn convergence_vaeanalyzing structural heterogeneity within the dataset:
tomodrgn analyze,tomodrgn eval_volandtomodrgn analyze_volumes, and external tools including SIREn and MAVEnvisualizing structural heterogeneity patterns in the tomogram’s spatial context: tomoDRGN’s interactive visualization jupyter notebook and
tomodrgn subtomo2chimeraxisolating particle subsets of interest: tomoDRGN’s interactive filtering jupyter notebook and
tomodrgn filter_startaking homogeneous particle subsets back into external STA tools for further refinement
With these steps as building blocks, many additional types of analyses are possible.
Note
These tutorials were originally written using data processed using the Warp v1 -> RELION v3 -> M STA pipeline. Therefore, the command syntax used in the tutorials is specific to “Warp v1 style inputs”. Alternative syntax for other STA pipelines, including “WarpTools style inputs”, are given in the command usage section.