Advanced Node scraping application

A Node scraping application for advanced reading data from websites

Web Scraping

Often, websites containing interesting data do not provide data via the API mechanism, but publish this data as text on the website. There is a problem in downloading such data and the scraping technique is used to retrieve it. Web scraping is the process of extracting data from a specific web page. It involves making an HTTP request to a website’s server, downloading the page’s HTML and parsing it to extract the desired data

.

How the application works

This application presents downloading data about books from an online store that does not provide data via API. The Node.js technology uses the popular Selenium scraping library. The procedure loads the original page into the browser, then parses the content of the HTML tags and extracts the data. Data is often presented on many subpages. This app after reading a page loads the next one and can save all the data.

.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
jpg		jpg
js		js
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced Node scraping application

Web Scraping

How the application works

About

Releases

Packages

Languages

License

janluksoft/Node_Web_Scraping_adv

Folders and files

Latest commit

History

Repository files navigation

Advanced Node scraping application

Web Scraping

How the application works

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages