Skip to content

RickC1218/WebCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Crawler

📘 Introduction

Using JavaScript, I have created a simple web crawler using scraping techniques to extract the first 30 entries from the website https://news.ycombinator.com/.

📝 Requirements

The requirements for this project are:

Important

The program should be able to extract the following information from the website:

  • Title
  • Points
  • Author
  • Number of comments

Note

For this project, I have to filter all entries with more than five words in the title ordered by the number of comments first.

Note

For this project, I have to filter all entries with less than or equal five words in the title ordered by the points first.

💻 Technologies

  • Node.js v18.7.0 or later
  • JavaScript ES6

▶️ Setup

To run this project, install it locally using npm:

$ cd ../web-crawler
$ npm install
$ npm start https://news.ycombinator.com/

If you want to create an Excel file with the output, you can use the following command:

$ npm start https://news.ycombinator.com fileName

🖼️ Images

This is an example of the output console of the program: Example console This is an example of the output file of the program: Example file

🧪 Testing

To run the tests, use the following command:

$ npm test

👨‍💻Sources

This project was created as part of a test of Software Development Intership program.

About

Crawler web with JavaScript

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published