Tag: Python

  • Rust: Python’s New Best Friend – A Data Scientist’s Journey

    Rust: Python’s New Best Friend – A Data Scientist’s Journey

    As Python continues to dominate data science, a quiet revolution is happening underneath the surface. Increasingly, Rust is powering our most critical Python tools—bringing unprecedented performance while maintaining the Python interface we know and love. This hybrid approach transforms our work as data scientists, enabling rapid development and production-grade performance.

    My journey with Rust began six years ago as a distant curiosity. I heard the name in conference talks and saw it climbing GitHub’s language popularity charts, but it remained just another programming language on my “maybe someday” list.

    (more…)
  • Why Probabilistic Programming? A Journey Through the Monty Hall Problem

    Why Probabilistic Programming? A Journey Through the Monty Hall Problem

    Even brilliant minds can be led astray by probability puzzles. When presented with the Monty Hall Problem, renowned mathematician Paul Erdős initially rejected the correct solution – and he wasn’t alone. Thousands of readers, including PhDs in mathematics and statistics, wrote angry letters to Marilyn vos Savant when she published the correct solution in Parade magazine. Their passionate resistance reveals something fascinating about how humans reason about uncertainty.

    To explore these ideas hands-on, we’ve created a Jupyter notebook that implements both traditional and probabilistic programming approaches to the Monty Hall Problem. The notebook includes code for simulating the game, modeling player behavior, and analyzing how people learn from experience.

    (more…)
  • Introducing chronowords: A Python Package for Diachronic Word Embeddings

    Introducing chronowords: A Python Package for Diachronic Word Embeddings

    We’re excited to announce the release of chronowords, a Python package designed to facilitate the analysis of semantic change in text over time. Through our research, we frequently encountered the need for temporal text analysis, which led us to develop this package to make diachronic (time-based) word embedding analysis more accessible.

    (more…)
  • Mapping with Altair and Folium

    Mapping with Altair and Folium

    Although we love QGIS we prefer programmatic solutions when we have to visualize data. The Python ecosystem offers excellent tools to produce interactive and static maps. We tested Altair and Folium on the same dataset.

    (more…)
  • Which country is the best? Punch card version

    Which country is the best? Punch card version

    The most-read post of all time on our blog is Which country is the best? Dealing with ranking data. Last week we used bump charts to visualize the data. This week we used a modified punch card diagram.

    (more…)