Learn practical skills, build real-world projects, and advance your career
Updated 3 years ago
Selenium project: Scrape data from online web
In this project we are going to use selenium to scrape Chinese BR number from https://xin.baidu.com/.
Assume we have input excel file with company name in first column, we will return a result excel with BR number and company address if found on the checking website.
#Reference source code: https://zhuanlan.zhihu.com/p/165282881 by 冷颜夕
Prerequisite:
- Corresponding Geckodriver must be available, we are using Firefox as our brower.
- Microsoft Excel is needed for reading and writing data.
Data Cleansing
The input data should only contain Chinese character with symbol ( or ).
If full-width ( or ) e.g. ( or ) is found, it will be replaced before searching.