ONLINE WORKSHOP: “SCRAPING” OF INFORMATION FROM WEB PAGES – CASE STUDY: BUSINESS REGISTRATION AGENCY

25 March 2021

In the following workshop organized by AAB College, we will be introduced to the techniques of “scraping” information from online websites.

Web Crawling (spider / spiderbot) / Web Scraping software is constantly used by search engines such as Google, Bing, Yahoo, Lycos, allowing them to collect information from millions of web pages and create giant databases. These methods are also used by various individuals for malicious purposes in the form of viruses.

In this workshop, the technique of “scraping” information will be explained in more detail and demonstrated practically by using simple methods and tools. 

The data collected by this method can then be used to create structured data (e.g., SQL Database).

For the needs of the demonstration, the official website of the Business Registration Agency – arbk.rks-gov.net has been selected, in which the data of nearly 200 thousand businesses can be found!

The official website of the Business Registration Agency: arbk.rks-gov.net which contains the data of nearly 200,000 businesses, was chosen as the site for the demonstration of the scraping technique. 

Those interested can apply via email: [email protected], on Monday, March 29, 20:00.

Share: