Skip to content
Back

ETE Toolkit: Phylogenomic Data Analysis and Visualization

Legacy EOSS

2020

Proposal Summary

To support the release and maintenance of a new version of the ETE toolkit including updated documentation and new features such as tree diff, tree-like regular expression searches, and large tree visualization.


ETE Toolkit

ETE is a Python framework that assists in the programmatic reconstruction, analysis and visualization of phylogenetic trees and multiple sequence alignments. This is the most common type of data obtained from the evolutionary analysis of genes and proteins coming from shotgun sequencing projects. ETE provides a comprehensive API to handle phylogenetic results and to integrate them into the scientific Python ecosystem (e.g. Jupyter notebooks). This includes a fully-featured system for programmatic visualization of hierarchical trees that complements other Python libraries such as matplotlib, plotly or seaborn. With more than ten years of continuous development, ETE is considered a standard tool for phylogenomic analysis, with more than 45,000 downloads from conda and over 850 citations in peer-reviewed papers with a clear increasing trend. The ETE team will boost the release of a new version of the ETE library including all novel features and bug fixes accumulated over the past years, update library documentation, tutorial, and cookbooks and also improve user support, unitests, continuous development system and community engagement. Those tasks represent the weakest corners in the sustainability of the project, given that, despite a large user community, ETE is developed by a small group of volunteer researchers without direct funding. The requested resources represent, therefore, the minimal setup to cope with an increasing demand of support, frequent updates, and academic use.

Project Team

Jaime Huerta Cepas Github

Centro de Biotecnología y Genómica de Plantas (CBGP UPM-INIA)

Renato Alves Github

European Molecular Biology Laboratory (EMBL)

Ziqi Deng Github

Centro de Biotecnología y Genómica de Plantas (CBGP UPM-INIA)

Ana Hernández Github

Centro de Biotecnología y Genómica de Plantas (CBGP UPM-INIA)

Francois Serra Github

Barcelona Supercomputing Center (BSC)