Stars
An opinionated list of awesome Python frameworks, libraries, software and resources.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
🍰 Desktop utility to download images/videos/music/text from various websites, and more.
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,…
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
😮python模拟登陆一些大型网站,还有一些简单的爬虫,希望对你们有所帮助❤️,如果喜欢记得给个star哦🌟
Asynchronous HTTP client/server framework for asyncio and Python
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
SpiderFoot automates OSINT for threat intelligence and mapping your attack surface.
Python version of the Playwright testing and automation library.
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
UI Automation Framework for Games and Apps
基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Library for building WebSocket servers and clients in Python
python爬虫教程,带你从零到一,包含js逆向,selenium, tesseract OCR识别,mongodb的使用,以及scrapy框架
curl-impersonate: A special build of curl that can impersonate Chrome & Firefox
Headless chrome/chromium automation library (unofficial port of puppeteer)
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
新闻网页正文通用抽取器 Beta 版.
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
JA3 is a standard for creating SSL client fingerprints in an easy to produce and shareable way.
Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.