Implementation of a Combined Web Crawler and Scraper for General Applications

TITLE	Implementation of a Combined Web Crawler and Scraper for General Applications
ABSTRACT	The goal is to build a smart internet tool that acts like a highly focused research assistant, automatically finding and organizing specific information from across the web. To do this efficiently, it uses promising links to focus on valuable content. Once it visits a page, it employs adaptable techniques to extract exact data, such as prices or reviews, and organizes it into a structured format. The end result is a powerful and flexible system that supports a wide range of uses, from tracking competitor prices to collecting data for academic research. It is designed to navigate common web challenges, being polite to servers, avoiding duplicate information, and handling dynamic content. Crucially, the system is architected to be smart and respectful, empowering users to make informed decisions based on publicly available web data.
AUTHOR	Mohammed Zaki Zaheer Khan, M. Gopi Vardhan, U. Rakesh, Nama Vikram, K. Anand Department of Computer Science and Engineering, Spoorthy Engineering College, Hyderabad, India
VOLUME	13
DOI	DOI:10.15680/IJARETY.2026.1302017
PDF	17_Implementation of a Combined Web Crawler and Scraper for General Applications.pdf
KEYWORDS
References	[1] F. Menczer, G. Pant, and P. Srinivasan, â€œWeb Crawling,â€ 2003. [2] â€œPlaywright Documentation,â€ https://playwright.dev/. [3] â€œMySQL Official Documentation,â€ https://dev.mysql.com/doc/.

Article