Skip to content
@Living-with-machines

Living with Machines

A radical collaboration between computational linguists, curators, data scientists, software engineers, geographers and historians

Popular repositories Loading

  1. DeezyMatch DeezyMatch Public

    A Flexible Deep Learning Approach to Fuzzy String Matching

    Jupyter Notebook 131 34

  2. MapReader MapReader Public

    A computer vision pipeline for exploring and analyzing images at scale

    Jupyter Notebook 76 10

  3. histLM histLM Public

    Neural Language Models for Historical Research

    Jupyter Notebook 23 21

  4. DiachronicEmb-BigHistData DiachronicEmb-BigHistData Public

    Tools to train and explore diachronic word embeddings from Big Historical Data

    Jupyter Notebook 18 2

  5. nnanno nnanno Public

    nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset

    Jupyter Notebook 17 1

  6. deduplify deduplify Public

    A Python tool to search for and remove duplicated files in messy datasets

    Python 16 2

Repositories

Showing 10 of 52 repositories
  • MapReader Public

    A computer vision pipeline for exploring and analyzing images at scale

    Living-with-machines/MapReader’s past year of commit activity
    Jupyter Notebook 76 10 83 (1 issue needs help) 6 Updated Jun 26, 2024
  • T-Res Public

    A Toponym Resolution Pipeline for Digitised Historical Newspapers

    Living-with-machines/T-Res’s past year of commit activity
    Python 7 1 27 4 Updated Jun 19, 2024
  • dhoxss-text2tech Public

    Materials for the Text to Tech workshop at the Digital Humanities Oxford Summer School

    Living-with-machines/dhoxss-text2tech’s past year of commit activity
    Jupyter Notebook 6 MIT 1 0 2 Updated Jun 18, 2024
  • lwmdb Public

    A django-based library for managing the Living with Machines newspapers metadata database schema

    Living-with-machines/lwmdb’s past year of commit activity
    CSS 2 MIT 0 35 10 Updated Jun 17, 2024
  • deduplify Public

    A Python tool to search for and remove duplicated files in messy datasets

    Living-with-machines/deduplify’s past year of commit activity
    Python 16 MIT 2 2 5 Updated Jun 17, 2024
  • wiki2gaz Public

    A series of scripts to create a gazetteer from a Wikipedia and Wikidata dump

    Living-with-machines/wiki2gaz’s past year of commit activity
    Python 0 MIT 0 1 4 Updated Jun 6, 2024
  • newspapers Public

    Public repository for material relating to open access historical newspapers

    Living-with-machines/newspapers’s past year of commit activity
    Jupyter Notebook 0 0 0 0 Updated Jun 6, 2024
  • DeezyMatch_tutorials Public

    Collection of tutorials for DeezyMatch (https://github.com/Living-with-machines/DeezyMatch)

    Living-with-machines/DeezyMatch_tutorials’s past year of commit activity
    Jupyter Notebook 6 0 1 1 Updated May 3, 2024
  • alto2txt2fixture Public

    Converts metadata from alto2txt into JSON data with corresponding relational IDs for ingestion into a relational database

    Living-with-machines/alto2txt2fixture’s past year of commit activity
    Python 0 MIT 1 11 4 Updated Apr 29, 2024
  • subsamplr Public

    A tool for representative subsampling

    Living-with-machines/subsamplr’s past year of commit activity
    Python 1 MIT 1 0 1 Updated Apr 8, 2024

Top languages

Loading…

Most used topics

Loading…