Skip to content

sararin/Discord-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Discord Scraper [Beta]

Release Date: October 3, 2017 @ 10:13 UTC-0

Tutorial

Step 1: Open/Login to your Discord app and press CTRL + SHIFT + I to open the Developer Tools panel. Take the Server ID and Channel ID data from the URL in the title of the Developer Tools panel. x74PJvp.png

Step 2: Go to the Application tab in the Developer Tools panel. Expand the Local Storage area on the left-side of the Developer Tools panel. Select the discordapp URL in the list. Highlight the data between the quotation marks in the token value and press CTRL + C to copy the token. hTsoSGd.png

Step 3: Open the Discord.py script in any Python IDE of your choosing (IDLE was used in this example). Replace the text '' with the token value you grabbed from Step 2. Replace the text '' with the Server ID data that you grabbed from Step 1. Replace the text '' with the Channel ID data that you grabbed from Step 1. Alter the Query value as necessary (the default is set to grab all NSFW and non-NSFW images and videos). dqsYX8q.png

Step 4: Make sure that you put the Discord.py script in its own folder since all contents will be placed in the same folder that the Discord.py file resides. Run the Discord.py script and it should prompt you for the number of pages to grab (this coincides with the page number from the search query) the general rule is 25 contents per page, so 2 pages would equate to about 50 items. Let the script run (it may take a while to finish) the beta version of this script is designed to halt upon error, but the final release of this script will ignore errors and continue grabbing data as demanded.

Warning

Do not run these scripts if you meet any of these limitations:

  • A CPU that is prone to overheating
  • A metered internet connection (monthly bandwidth caps)
  • A storage device that is low on available space

Changelog (DD-MM-YYYY)

07-04-2018 - Beta Fix #3:

  • Fixed threading issue (too many concurrent threads)
  • Fixed filename issues when grabbing files with similar filenames (still a potential issue with large amounts of files but significantly less issues)

07-04-2018 - Beta Fix #2:

  • Fixed problems when downloading from channels with less than 25 images/videos as the older scripts assumed more than 25 images/videos in the channel.
  • I will incorporate a better method of grabbing images where there's less corruptions and less missing photos.

21-02-2018 - Beta Update #1:

  • Updated this readme to include warning information
  • Created a version for those running Python 3
  • Updated the Python 2 version to match the Python 3 version with threading support

03-10-2017 - Beta Fix #1:

  • Fixed issue with URL appending offset query information ad infinitum.
  • Fixed issue with uninitialized opener data when grabbing multiple pages of JSON data.
  • Added new function to allow for the resetting of opener data when grabbing JSON data.

03-10-2017 - Beta Release:

  • The first release of the script.
  • Not meant for production use.
  • Still has bugs to fix and features to implement.

About

A Python scraper for Discord

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages