Skip to content

A simple python scraper for the gallica.bnf.fr website (output is High Res JPEG)

Notifications You must be signed in to change notification settings

nazmifr/BNF_Gallica_Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

BNF_Gallica_Scraper

Say you have an old ereader that don't support PDF scans and you still want to read old books from the awesome https://gallica.bnf.fr archive. Well you just use this script.

Plz no abuse! The french gov't has been nice enough to set this thing up, let's be thankful and not di*ks

🇫🇷🇫🇷🇫🇷 French version of the Doc: https://nazmi.fr/gallica_bnf_scraper/

Video tutorial coming soon

How To

Download bnfscrape.py

Put it in an empty folder

Have python and python-pip installed + the dependency library: wget

pip-install wget

Edit the 3 variables in the top of the script

fraum = download from this page

tau = download to this page

part1 = beginning of the URL, see the example

(to get the url, just right click on one page of the document in the booklet viewer and then click "open image in new tab / view image" )

About

A simple python scraper for the gallica.bnf.fr website (output is High Res JPEG)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages