Skip to content
@arcalex

Web Archiving at BA

bibalex.org - Web Archiving

Popular repositories Loading

  1. warcrefs warcrefs Public

    Web archive deduplication tools

    Java 6 1

  2. warcsum warcsum Public

    Web archive checksum

    C 4

  3. linkgate linkgate Public

    common material for IIPC Project LinkGate, including research use cases for web archive graph visualization

    Shell 4 1

  4. waget waget Public

    Incrementally fetch web archive data files and run actions

    Shell 2 1

  5. racktk racktk Public

    Command-line tools for computer clusters

    Perl 1 1

  6. purge-old-kernels purge-old-kernels Public

    Purge old kernel packages on a Debian system

    Shell 1

Repositories

Showing 10 of 24 repositories
  • moodwarc Public

    Analysis of how content on the web affects mood over time

    arcalex/moodwarc’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Jul 30, 2024
  • link-indexer Public

    linked data collection tool for web archive graph visualization (LinkGate)

    arcalex/link-indexer’s past year of commit activity
    Python 0 GPL-3.0 2 6 1 Updated Feb 22, 2024
  • warclassifier Public

    Machine learning-based content classification for web archives

    arcalex/warclassifier’s past year of commit activity
    0 GPL-3.0 1 0 0 Updated Oct 5, 2023
  • warchtml Public

    Extract HTML data from WARC files

    arcalex/warchtml’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Aug 25, 2023
  • meshwarc Public

    Wxploring the semantic-based web archive graph using MeshWARC

    arcalex/meshwarc’s past year of commit activity
    0 GPL-3.0 0 0 0 Updated Aug 25, 2023
  • txtcrawl-rs Public

    Crawl the web for text (Rust implementation)

    arcalex/txtcrawl-rs’s past year of commit activity
    0 GPL-3.0 1 0 0 Updated Aug 25, 2023
  • waget Public

    Incrementally fetch web archive data files and run actions

    arcalex/waget’s past year of commit activity
    Shell 2 GPL-3.0 1 0 0 Updated Apr 18, 2023
  • link-serv Public

    versioned graph data service for web archive graph visualization (LinkGate)

    arcalex/link-serv’s past year of commit activity
    Java 0 GPL-3.0 2 13 8 Updated Dec 10, 2022
  • iipc-collections Public

    Republishing IIPC collections through alternative interfaces for researcher access

    arcalex/iipc-collections’s past year of commit activity
    Shell 0 GPL-3.0 0 0 0 Updated Sep 8, 2022
  • txtcrawl Public

    Crawl the web for text

    arcalex/txtcrawl’s past year of commit activity
    Python 0 GPL-3.0 1 0 0 Updated Aug 25, 2022

Top languages

Loading…

Most used topics

Loading…