Learn practical skills, build real-world projects, and advance your career

Selenium project: Scrape data from online web

In this project we are going to use selenium to scrape Chinese BR number from https://xin.baidu.com/.
Assume we have input excel file with company name in first column, we will return a result excel with BR number and company address if found on the checking website.

#Reference source code: https://zhuanlan.zhihu.com/p/165282881 by 冷颜夕

Prerequisite:

  1. Corresponding Geckodriver must be available, we are using Firefox as our brower.
  2. Microsoft Excel is needed for reading and writing data.

Data Cleansing

The input data should only contain Chinese character with symbol ( or ).

If full-width ( or ) e.g. ( or ) is found, it will be replaced before searching.

How to run the code