How to Do Web Scraping Using Python
Web scraping is a technique to extract data from webpage using computer program. In this tutorial we will demonstrate how to collect news links and title from a newspaper website for educational purpose. During the journey we will learn about Python’s built-in URLLIB library to connect website and retrieve data and a third party BeautifulSoup library to parse HTML data so that we can filter and take out our target links. We will also use a built-in logging library to write error in text file if any error happens during the program runs. The main library for web scraping is designed using Object Oriented Programming (OOP) style.
6