Credits

teiphy was designed by Joey McCollum (Australian Catholic University) and Robert Turnbull (University of Melbourne). We received additional help from Stephen C. Carlson (Australian Catholic University).

If you use this software, please cite the paper: Joey McCollum and Robert Turnbull, “teiphy: A Python Package for Converting TEI XML Collations to NEXUS and Other Formats,” JOSS 7.80 (2022): 4879, DOI: 10.21105/joss.04879.

@article{MT2022,
    author = {Joey McCollum and Robert Turnbull},
    title = {{teiphy: A Python Package for Converting TEI XML Collations to NEXUS and Other Formats}},
    journal = {Journal of Open Source Software},
    year = {2022},
    volume = {7},
    number = {80},
    pages = {4879},
    publisher = {The Open Journal},
    doi = {10.21105/joss.04879},
    url = {https://doi.org/10.21105/joss.04879}
}

Further details on the capabilities of teiphy, particularly in terms of the text-critically valuable features it can map from TEI XML collations to BEAST 2 inputs, are discussed in Joey McCollum and Robert Turnbull, “Using Bayesian Phylogenetics to Infer Manuscript Transmission History,” DSH 39.1 (2024): 258–279, DOI: 10.1093/llc/fqad089.

@article{MT2024,
    author = {Joey McCollum and Robert Turnbull},
    title = {{Using Bayesian Phylogenetics to Infer Manuscript Transmission History}},
    journal = {Digital Scholarship in the Humanities},
    year = {2024},
    volume = {39},
    number = {1},
    pages = {258--279},
    doi = {10.1093/llc/fqad089},
    url = {https://doi.org/10.1093/llc/fqad089}
}

Bibliography

teiphy relies on the following research. Please cite as appropriate:

[AAB+15]

Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dandelion Mané, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Viégas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. TensorFlow: large-scale machine learning on heterogeneous systems. 2015. URL: https://www.tensorflow.org/.

[AAK+14]

Barbara Aland, Kurt Aland, Johannes Karavidopoulos, Carlo M. Martini, and Bruce M. Metzger, editors. The Greek New Testament. Deutsche Bibelgesellschaft, Stuttgart, 5 edition, 2014.

[Bal10]

Clinton S. Baldwin. Factor analysis: a new method for classifying New Testament Greek manuscripts. Andrews University Seminary Studies, 48(1):29–53, 2010.

[BHBR98]

Adrian C. Barbrook, Christopher J. Howe, Norman Blake, and Peter Robinson. The phylogeny of \emph The Canterbury Tales. Nature, 394:839, 1998. doi:10.1038/29667.

[BVBS+19]

Remco Bouckaert, Timothy G. Vaughan, Joëlle Barido-Sottani, Sebastián Duchêne, Mathieu Fourment, Alexandra Gavryushkina, Joseph Heled, Graham Jones, Denise Kühnert, Nicola De Maio, Michael Matschiner, Fábio K. Mendes, Nicola F. Müller, Huw A. Ogilvie, Louis du Plessis, Alex Popinga, Andrew Rambaut, David Rasmussen, Igor Siveroni, Marc A. Suchard, Chieh-Hsi Wu, Dong Xie, Chi Zhang, Tanja Stadler, and Alexei J. Drummond. BEAST 2.5: an advanced software platform for Bayesian evolutionary analysis. PLOS Computational Biology, 15(4):1–28, 2019. doi:10.1371/journal.pcbi.1006650.

[Car15]

Stephen C. Carlson. The Text of Galatians and Its History. Number 385 in Wissenschaftliche Untersuchungen zum Neuen Testament. Mohr Siebeck, Tübingen, 2015. ISBN 978-3-16-153323-5. doi:10.1628/978-3-16-153324-2.

[Edm19]

Andrew Charles Edmondson. An Analysis of the Coherence-Based Genealogical Method Using Phylogenetics. University of Birmingham, PhD diss., 2019. URL: http://etheses.bham.ac.uk/id/eprint/9150.

[Far88]

James S. Farris. Hennig86, ver. 1.5. Program and Documentation. James S. Farris, Port Jefferson Station, NY, 1988.

[Fel04]

Joseph Felsenstein. Inferring Phylogenies. Sinauer Associates, Sunderland, MA, 2004.

[Fin18]

Timothy J. Finney. How to discover textual groups. Digital Studies/Le champ numérique, 2018. doi:10.16995/dscn.291.

[Fis20]

Franz Fischer. Representing the critical text. In Philipp Roelli, editor, Handbook of Stemmatology: History, Methodology, Digital Approaches, De Gruyter Reference, pages 405–427. De Gruyter, Berlin, 2020.

[GC16]

Pablo A. Goloboff and Santiago A. Catalano. TNT, version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics, 32(3):221–238, 2016. doi:10.1111/cla.12160.

[HMvandWalt+20]

Charles R. Harris, K. Jarrod Millman, Stéfan J. van der Walt, Ralf Gommers, Pauli Virtanen, David Cournapeau, Eric Wieser, Julian Taylor, Sebastian Berg, Nathaniel J. Smith, Robert Kern, Matti Picus, Stephan Hoyer, Marten H. van Kerkwijk, Matthew Brett, Allan Haldane, Jaime Fernández del Río, Mark Wiebe, Pearu Peterson, Pierre Gérard-Marchant, Kevin Sheppard, Tyler Reddy, Warren Weckesser, Hameer Abbasi, Christoph Gohlke, and Travis E. Oliphant. Array programming with NumPy. Nature, 585:357–362, 2020. doi:10.1038/s41586-020-2649-2.

[Hyytiainen21]

Pasi Hyytiäinen. The changing text of Acts: a phylogenetic approach. TC: A Journal of Biblical Textual Criticism, 26:1–28, 2021.

[ISM95]

Nancy M. Ide and C.M. Sperberg-McQueen. The TEI: history, goals and future. Computers and the Humanities, 29(1):5–15, 1995. doi:10.1007/978-94-011-0325-1_2.

[Lew01]

Paul O. Lewis. A likelihood approach to estimating phylogeny from discrete morphological character data. Systematic Biology, 50(6):913–925, November 2001. URL: https://doi.org/10.1080/106351501753462876 (visited on 2020-11-08), doi:10.1080/106351501753462876.

[MSM97]

David R. Maddison, David L. Swofford, and Wayne P. Maddison. NEXUS: an extensible file format for systematic information. Systematic Biology, 46(4):590–621, 12 1997. URL: https://doi.org/10.1093/sysbio/46.4.590, arXiv:https://academic.oup.com/sysbio/article-pdf/46/4/590/19502018/46-4-590.pdf, doi:10.1093/sysbio/46.4.590.

[MSC+20]

Bui Quang Minh, Heiko A. Schmidt, Olga Chernomor, Dominik Schrempf, Michael D. Woodhams, Arndt von Haeseler, and Robert Lanfear. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Molecular Biology and Evolution, 37(5):1530–1534, 2020. URL: https://doi.org/10.1093/molbev/msaa015, doi:10.1093/molbev/msaa015.

[PVG+11]

Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake Vanderplas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, and Édouard Duchesnay. Scikit-learn: machine learning in Python. Journal of Machine Learning Research, 12(85):2825–2830, 2011.

[ROHara92]

Peter Robinson and Robert J. O'Hara. Report on the Textual Criticism Challenge 1991. Bryn Mawr Classical Review, 3(4):331–337, 1992.

[RTvandMark+12]

Fredrik Ronquist, Maxim Teslenko, Paul van der Mark, Daniel L. Ayres, Aaron Darling, Sebastian Höhna, Bret Larget, Liang Liu, Marc A. Suchard, and John P. Huelsenbeck. MRBAYES 3.2: efficient Bayesian phylogenetic inference and model selection across a large model space. Systematic Biology, 61(3):539–542, 2012. doi:10.1093/sysbio/sys029.

[Sal00]

B. J. P. Salemans. Building Stemmas with the Computer in a Cladistic, Neo-Lachmannian, Way: The Case of Fourteen Text Versions of Lanseloet van Denemerken. Katholieke Universiteit Nijmegen, PhD diss., 2000. URL: https://hdl.handle.net/2066/147058.

[SWH02]

Matthew Spencer, Klaus Wachtel, and Christopher J. Howe. The Greek Vorlage of the Syra Harclensis: a comparative study on method in exploring textual genealogy. TC: A Journal of Biblical Textual Criticism, 2002. URL: http://jbtc.org/v07/SWH2002/index.html.

[SWH04]

Matthew Spencer, Klaus Wachtel, and Christopher J. Howe. Representing multiple pathways of textual flow in the Greek manuscripts of the Letter of James using reduced median networks. Computers and the Humanities, 38:1–14, 2004. doi:10.1023/B:CHUM.0000009290.14571.59.

[Sta14]

Alexandros Stamatakis. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics, 30(9):1312–1313, 2014. doi:10.1093/bioinformatics/btu033.

[Swo03]

D. L. Swofford. Paup*: phylogenetic analysis using parsimony (*and other methods). version 4. 2003.

[Tho02]

J. C. Thorpe. Multivariate statistical analysis for manuscript classification. TC: A Journal of Biblical Textual Criticism, 2002. URL: http://jbtc.org/v07/Thorpe2002.html.

[Tur20]

Robert Turnbull. The Textual History of Codex Sinaiticus Arabicus and Its Family. Ridley College, PhD diss., 2020.

[Wil08]

Wieland Willker. Principal component analysis of manuscripts of the Gospel of John. 2008. URL: http://www.willker.de/wie/TCG/PCA/index.html.

[ZZ12]

Marinka Zitnik and Blaz Zupan. NIMFA: a Python library for nonnegative matrix factorization. Journal of Machine Learning Research, 13:849–853, 2012.

[McCollum19]

Joey McCollum. Biclustering readings and manuscripts via non-negative matrix factorization, with application to the text of Jude. Andrews University Seminary Studies, 57(1):61–89, 2019.

[McKinney10]

Wes McKinney. Data structures for statistical computing in Python. In Stéfan van der Walt and Jarrod Millman, editors, Proceedings of the 9th Python in Science Conference, 56–61. 2010. doi:10.25080/Majora-92bf1922-00a.

[TEIConsortium22]

TEI Consortium. TEI P5: guidelines for electronic text encoding and interchange: critical apparatus [v.4.4.0]. https://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html, 2022. [Online; accessed 3-September-2022].

[Thepdteam20]

The pandas development team. Pandas-dev/pandas: pandas. 2020. doi:10.5281/zenodo.3509134.