Last updated: 26 August 2025
Authors: Canan Karakoç, William R. Shoemaker, Jay T. Lennon
This repository contains the complete analysis pipeline of the manuscript "Evolutionary bioenergetics of sporulation". The project integrates genomic, proteomic, transcriptomic, and by-the-number datasets to quantify ATP investment across the spore lifecycle, compares with other cellular investments. It also incorporates these estimates into mechanistic models to show how metabolic constraints shape sporulation efficiency, and how this developmental program becomes visible to selection.
- bioaccounting/: Bioenergetic calculations using by-the-numbers, spore formation and revival data.
- efficiency/: Data and plots related to sporulation efficiency across studies. Models for population dynamics in batch and chemostat conditions and sporulation efficiency.
- evolution/: Data and figures of sporulation COGs. Model outputs and figures related to evolutionary dynamics.
- All by-the-numbers calculations in the manuscript are embedded in the R code with their references, such as membrane area and membrane costs calculations, genome size, nucleotide costs, and replication costs.
- Key datasets such as amino acid costs, spore-formation, and revival genes/proteins are listed below.
- The project includes models and simulations written in Python, and the codes are provided in their respective folder.
To reproduce the analysis:
- Run
code\cost_of_spore.Rfrom the top for bioaccounting. Ensure the working directory is set properly. - Run Python codes to reproduce data for population dynamics, spore efficiency, and evolutionary analysis. Codes are stored in "efficiency" and "evolution" sections under "model" folders.
- Run
efficiency_plots.RandCOGs_figures.Rto reproduce figures of empirical and model data of population dynamics, efficiency, and evolutionary outcomes. These codes are stored under respective sections and "empirical" folders.
| Dataset Description | Source |
|---|---|
| Gene & protein annotation | SubtiWiki |
| Expression data | SporeWeb |
| Protein abundance | PAX-db |
| Newly synthesized proteins during germination | Swarge et al. 2020 |
| Protein sequences | UniProt: B. subtilis 168 |
| Amino acid/nucleotide ATP costs | Mahmoudabadi et al. 2017 |
| Cost Type | Description |
|---|---|
| Opportunity Cost | ATP-equivalent energy required to synthesize building blocks. |
| Direct Cost | ATPs used in polymerization and structural assembly. |
| Total Cost | Sum of opportunity + direct costs. |
Spore-related COGs are derived from Galperin et al. 2022. See manuscript, supplementary information for empirical efficiency data derived from various sources.
- Figure 1: Cumulative ATP costs of sporulation over time. (bioaccounting/)
- Figure 2: Energetic costs of germination and outgrowth. (bioaccounting/)
- Figure 3: Comparison with alternative stress responses and cellular processes. (bioaccounting/)
- Figure 4: Empirical and theoretical models of sporulation efficiency. (efficiency/)
- Figure 5: COG distributions and evolutionary constraints. (evolution/)
See embedded DOIs in script comments. Full reference list available in the manuscript.
- All scripts are annotated and reproducible within the RStudio project.
- Final manuscript figures formatted in Adobe Illustrator.
Jay T. Lennon — lennonj@iu.edu