Data Enrichment Pipeline

Client: Leadbook Pte Ltd
Tech Stack: Python, requests, Beautiful Soup, Proxy rotator, MongoDB, Apache Kafka
Github URL: Project Preview Link

Developed and maintained a data pipeline for enriching raw data from various sources such as web scraping, third-party APIs, and files using Python, Beautiful Soup, and requests. Implemented proxy rotators for data acquisition and used MongoDB as a data store for the transformed data. Leveraged Apache Kafka for data enrichment and processing.