Learn practical skills, build real-world projects, and advance your career

Top Repositories for Github Topics

 

Pick a website and describe your objective

  • Browse through different sites and pick on to scrape. Check the "Project Ideas" section for inspiration.
  • Identify the information you'd like to scrape from the site. Decide the format of the output CSV file.
  • Summarize your project idea and outline your strategy in a Juptyer notebook. Use the "New" button above.

Project Outline

  • we're going to scrap https://github.com/topics
  • we'll get a list of topics. For each topic, we'll get topic title, topic page URL and topic description.
  • For each topic , we'll get the top 25 repositories from the topic page.
  • For each repository we'll grab the repo name, username, stars and repo URL.
  • For each topic we'll create a csv file in th following format:
Repo name,Username,Stars,Repo URL
three.js,mrdoob,78900,https://github.com/mrdoob/three.js
libgdx,libgdx,19600,https://github.com/libgdx/libgdx

Use the requests library to download web pages