GPT4 Vision Web Crawling

This is a GPT4 Vision API and Puppeteer powered tool that can answer questions based on website screenshots. You ask it a question and it will browse to a website and take a screenshot. Then it will use GPT4 Vision API to answer the question based on the screenshot.

JavaScript version

The JavaScript version (vision_crawl.js) is able to not only open a URL directly, but it can also click on links on pages.

$ npm install
$ node vision_crawl.js

Python version

The Python version (vision_crawl.py) is the original version, that only opens one URL at a time directly. The Python version uses JavaScript too, for the Puppeteer part.

$ npm install
$ pip install -r requirements.txt
$ python3 vision_crawl.py

Examples

You can ask stuff like this, for example:

"What is the weather like in California?"
"What are the latest news in the world?"
"What is the current stock price of Tesla?"
"How many subscribers does Unconventional Coding have on YouTube?"

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.gitignore		.gitignore
.nvmrc		.nvmrc
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
vision_browse.js		vision_browse.js
websocket_client.js		websocket_client.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT4 Vision Web Crawling

JavaScript version

Python version

Examples

About

Releases

Packages

Languages

mrdavtan/AI_Vision_Browser

Folders and files

Latest commit

History

Repository files navigation

GPT4 Vision Web Crawling

JavaScript version

Python version

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages