TY - JOUR T1 - Phyletic Profiling with Cliques of Orthologs Is Enhanced by Signatures of Paralogy Relationships A1 - Škunca, Nives A1 - Bošnjak, Matko A1 - Kriško, Anita A1 - Panov, Panče A1 - Džeroski, Sašo A1 - Šmuc, Tomislav A1 - Supek, Fran Y1 - 2013/01/03 N2 - Author Summary While both the number and the diversity of sequenced prokaryotic genomes grow rapidly, the number of specific assignments of gene functions in the databases remains low and skewed toward the model prokaryote Escherichia coli. To aid in understanding the full set of newly sequenced genes, we created a computational model for assignment of function to prokaryotic genomes. The result is an innovative framework for orthology and paralogy-aware phyletic profiling that provides a large number of computational annotations with high predictive accuracy in train/test evaluations. Our predictions include annotations for 1.3 million genes with the estimated Precision of 90%; these, and many more predictions for 998 prokaryotic genomes are freely available at http://gorbi.irb.hr/. More importantly, we show a proof of principle that our functional annotation model can be used to generate new biological hypotheses: we performed experiments on 38 E. coli knockout mutants and showed that our annotation model provides realistic estimates of predictive accuracy. With this, our work will contribute to making experimental validation of computational predictions more approachable, both in cost and time. JF - PLOS Computational Biology JA - PLOS Computational Biology VL - 9 IS - 1 UR - https://doi.org/10.1371/journal.pcbi.1002852 SP - e1002852 EP - PB - Public Library of Science M3 - doi:10.1371/journal.pcbi.1002852 ER -