Phonometrica¶
Overview¶
Phonometrica is a free, open-source software platform for the annotation and analysis of speech corpora. It offers a user-friendly interface to manage, annotate, analyze and query language corpora. It is particularly well suited for dealing with time-aligned data, and integrates corpus management, acoustic analysis, and statistical modeling into a single workflow. The main features it offers are:
Project management: organize files into projects with extensible metadata (properties).
Sound visualization and analysis: visualize waveforms, spectrograms, pitch tracks, formant tracks, intensity curves, spectral slices, and spectral moments.
Sound annotation: create and edit multi-layer annotations based on annotation graphs; import and export Praat TextGrids.
Speech processing: detect silences to pre-segment recordings for manual annotation; automatic speech recognition via whisper.cpp (runs locally on CPU, requires a model file).
Text and acoustic queries: search for text patterns across annotation layers using simple or complex multi-constraint queries; extract formant, pitch, intensity, and spectral moment measurements.
Concordance and dataset views: browse, filter, recode, transform, and merge query results; vowel normalization (Lobanov, Nearey, Watt & Fabricius); toggle between wide and long formats; perform set operations on concordances.
Statistical analysis (preview/beta): frequentist and Bayesian linear, logistic, Poisson, negative binomial, beta, and robust Student t regression models, including mixed-effects models and generalized additive models (GAMs); post-hoc and diagnostic tests; exploratory visualizations.
Scripting engine: Phonometrica can be configured and extended with an easy-to-use scripting language, JSON-based plugins, and coding protocols.
Research notes: keep free-form rich-text notes alongside your data, organized within the project.
Standard-based: Phonometrica files are encoded in XML and Unicode.
Interaction with Praat: Phonometrica can read and write TextGrid files and open files directly in Praat from the file manager, annotation views, and concordance views.
Phonometrica runs on all major platforms (Windows, macOS and GNU/Linux) and is freely available under the terms of the GNU General Public License (version 3). The latest version can be downloaded from http://www.phonometrica-ling.org. The source code is available at https://github.com/jeychenne/phonometrica. If you have questions, problems, or would like to report a bug, please contact us at julien.eychenne@usherbrooke.ca.
Download¶
Phonometrica 0.9.1¶
Windows: setup_phonometrica.exe
macOS: Phonometrica-0.9.1.dmg
Linux (Debian/Ubuntu): phonometrica-0.9.1.deb
source code: phonometrica-0.9.1.zip | phonometrica-0.9.1.tar.gz
Documentation¶
Phonometrica’s documentation is available from its website file. It is also accessible from within the application via the Help menu and help buttons available in each view.
Topics¶
- Installation
- Getting started
- Preferences
- Sound visualization and analysis
- Sound annotation
- Speech
- Queries
- Concordances
- Datasets
- Vowel normalization
- Statistical analysis
- Column transformations
- Research notes
- Praat integration
- Keyboard shortcuts
- Scripting
- Plugins
- License
- Acknowledgements
- Release notes
How to cite¶
To cite Phonometrica, please use the following reference [EYC2025]:
Eychenne, Julien & Léa Courdès-Murphy (2025). Annotation et analyse de données sociophonologiques sur grands corpus : présentation de la plateforme Phonometrica. In Wim Remysen & Hélène Blondeau (eds.) (Re)donner la parole aux corpus montréalais : Regards rétrospectifs et prospectifs. Montreal: Presses Universitaires de Montréal, pp. 255–270.
You may also cite the earlier conference paper [EYC2019]:
Eychenne, Julien & Léa Courdès-Murphy (2019). Phonometrica: an open platform for the analysis of speech corpora. Proceedings of the Seoul International Conference on Speech Sciences 2019, Seoul National University, pp. 107–108.