`Crawler`

Multi-threaded Web crawler with support for custom fetching and persisting logic.

Usage

NOTE: See the crates documentation for more info.

As a binary

The following command will run the crawler with 10 threads, starting with the URL http://example.com and storing the visited websites as files in the ./crawlings directory.

cargo run --bin crawler http://example.com ./crawlings 10

As a library

extern crate crawler;

use crawler::traits::{Fetch, Persist};
use crawler::crawler::Crawler;

// ... trait implementations for `Fetch` and `Persist`

fn main() {
    let url = "http://example.com";
    let num_threads: usize = 2;

    let persister = YourPersister::new();
    let fetcher = YourFetcher::new();

    let mut crawler = Crawler::new(persister, fetcher, num_threads);
    let _result = crawler.start(url);
}

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`Crawler`

Usage

As a binary

As a library

About

Releases

Packages

Languages

pmuens/crawler

Folders and files

Latest commit

History

Repository files navigation

Crawler

Usage

As a binary

As a library

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`Crawler`

Packages