Data Enrichment Pipeline
- Client: Leadbook Pte Ltd
- Tech Stack: Python, requests, Beautiful Soup, Proxy rotator, MongoDB, Apache Kafka
- Github URL: Project Preview Link
Developed and maintained a data pipeline for enriching raw data from various sources such as web scraping, third-party APIs, and files using Python, Beautiful Soup, and requests. Implemented proxy rotators for data acquisition and used MongoDB as a data store for the transformed data. Leveraged Apache Kafka for data enrichment and processing.