Software

I develop open-source tools for computational linguistics and data analysis. All projects are available on GitHub.

Python Libraries

dafsa

Deterministic Acyclic Finite State Automaton library for efficient string matching and morphological analysis.

Repository: github.com/tresoldi/dafsa

ngesh

Library for generating synthetic phylogenetic data for testing and validation of computational methods.

Repository: github.com/tresoldi/ngesh

freqprob

Tools for computing frequency probabilities in linguistic datasets.

Repository: github.com/tresoldi/freqprob

asymcat

Statistical analysis of asymmetric categorical data with applications in typological research.

Repository: github.com/tresoldi/asymcat

C Tools

acopost

Part-of-speech tagging system implemented in C for efficient text processing.

Repository: github.com/tresoldi/acopost

Other Projects

Additional software projects and tools will be listed here as they are developed.

Philosophy

My software development follows these principles:

  • Open Source: All tools are freely available
  • Documentation: Comprehensive documentation and examples
  • Testing: Robust test suites for reliability
  • Standards: Following community best practices
  • Reproducibility: Enabling reproducible research workflows

Installation & Usage

Most Python packages can be installed via pip:

pip install package-name

See individual repositories for specific installation instructions and usage examples.