Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
Bioconductor
Software project for the analysis of genomic data

Bioconductor is a free, open source and open development software project for the analysis and comprehension of genomic data generated by wet lab experiments in molecular biology.

Bioconductor is based primarily on the statistical R programming language, but does contain contributions in other programming languages. It has two releases each year that follow the semiannual releases of R. At any one time there is a release version, which corresponds to the released version of R, and a development version, which corresponds to the development version of R. Most users will find the release version appropriate for their needs. In addition there are many genome annotation packages available that are mainly, but not solely, oriented towards different types of microarrays.

The project was started in the Fall of 2001 and is overseen by the Bioconductor core team, based primarily at the Fred Hutchinson Cancer Research Center, with other members coming from international institutions.

We don't have any images related to Bioconductor yet.
We don't have any YouTube videos related to Bioconductor yet.
We don't have any PDF documents related to Bioconductor yet.
We don't have any Books related to Bioconductor yet.
We don't have any archived web articles related to Bioconductor yet.

Packages

Most Bioconductor components are distributed as R packages, which are add-on modules for R. Initially most of the Bioconductor software packages focused on the analysis of single channel Affymetrix and two or more channel cDNA/Oligo microarrays. As the project has matured, the functional scope of the software packages broadened to include the analysis of all types of genomic data, such as SAGE, sequence, or SNP data.

Goals

The broad goals of the projects are to:

Main features

  • Documentation and reproducible research. Each Bioconductor package contains at least one vignette, which is a document that provides a textual, task-oriented description of the package's functionality. These vignettes come in several forms. Many are simple "How-to"s that are designed to demonstrate how a particular task can be accomplished with that package's software. Others provide a more thorough overview of the package or might even discuss general issues related to the package. In the future, the Bioconductor project is looking towards providing vignettes that are not specifically tied to a package, but rather are demonstrating more complex concepts. As with all aspects of the Bioconductor project, users are encouraged to participate in this effort.
  • Statistical and graphical methods. The Bioconductor project aims to provide access to a wide range of powerful statistical and graphical methods for the analysis of genomic data. Analysis packages are available for: pre-processing Affymetrix and Illumina, cDNA array data; identifying differentially expressed genes; graph theoretical analyses; plotting genomic data. In addition, the R package system itself provides implementations for a broad range of state-of-the-art statistical and graphical techniques, including linear and non-linear modeling, cluster analysis, prediction, resampling, survival analysis, and time series analysis.
  • Genome annotation. The Bioconductor project provides software for associating microarray and other genomic data in real time to biological metadata from web databases such as GenBank, LocusLink and PubMed (annotate package). Functions are also provided for incorporating the results of statistical analysis in HTML reports with links to annotation WWW resources. Software tools are available for assembling and processing genomic annotation data, from databases such as GenBank, the Gene Ontology Consortium, LocusLink, UniGene, the UCSC Human Genome Project and others with the AnnotationDbi package. Data packages are distributed to provide mappings between different probe identifiers (e.g. Affy IDs, LocusLink, PubMed). Customized annotation libraries can also be assembled.This project also contain several functions for genomic analysis and phylogenetic (e.g. ggtree, phytools packages ..).
  • Open source. The Bioconductor project has a commitment to full open source discipline, with distribution via a SourceForge.net-like platform. All contributions are expected to exist under an open source license such as Artistic 2.0, GPL2, or BSD. There are many different reasons why open-source software is beneficial to the analysis of microarray data and to computational biology in general. The reasons include:
    • To provide full access to algorithms and their implementation
    • To facilitate software improvements through bug fixing and plug-ins
    • To encourage good scientific computing and statistical practice by providing appropriate tools and instruction
    • To provide a workbench of tools that allow researchers to explore and expand the methods used to analyze biological data
    • To ensure that the international scientific community is the owner of the software tools needed to carry out research
    • To lead and encourage commercial support and development of those tools that are successful
    • To promote reproducible research by providing open and accessible tools with which to carry out that research (reproducible research is distinct from independent verification)
  • Open development. Users are encouraged to become developers, either by contributing Bioconductor compliant packages or documentation. Additionally Bioconductor provides a mechanism for linking together different groups with common goals to foster collaboration on software, possibly at the level of shared development.

Milestones

Each release of Bioconductor is developed to work best with a chosen version of R.1 In addition to bugfixes and updates, a new release typically adds packages. The table below maps a Bioconductor release to a R version and shows the number of available Bioconductor software packages for that release.

VersionRelease datePackage countR dependency
3.2030 Oct 20242289R 4.4
3.191 May 20242300R 4.4
3.1825 Oct 20232266R 4.3
3.162 Nov 20222183R 4.2
3.1427 Oct 20212083R 4.1
3.1128 Apr 20201903R 4.0
3.1030 Oct 20191823R 3.6
3.831 Oct 20181649R 3.5
3.631 Oct 20171473R 3.4
3.418 Oct 20161296R 3.3
3.214 Oct 20151104R 3.2
3.014 Oct 2014934R 3.1
2.1315 Oct 2013749R 3.0
2.113 Oct 2012610R 2.15
2.91 Nov 2011517R 2.14
2.814 Apr 2011466R 2.13
2.718 Nov 2010418R 2.12
2.623 Apr 2010389R 2.11
2.528 Oct 2009352R 2.10
2.421 Apr 2009320R 2.9
2.322 Oct 2008294R 2.8
2.21 May 2008260R 2.7
2.18 Oct 2007233R 2.6
2.026 Apr 2007214R 2.5
1.94 Oct 2006188R 2.4
1.827 Apr 2006172R 2.3
1.714 Oct 2005141R 2.2
1.618 May 2005123R 2.1
1.525 Oct 2004100R 2.0
1.417 May 200481R 1.9
1.330 Oct 200349R 1.8
1.229 May 200330R 1.7
1.119 Oct 200220R 1.6
1.01 May 200215R 1.5

Resources

See also

  • Free and open-source software portal
  • Biology portal

References

  1. "Bioconductor – Release Announcements". bioconductor.org. Bioconductor. Retrieved 28 May 2019. https://bioconductor.org/about/release-announcements/