← Back to projects

Amazgo

REST APIs for scraping Amazon product data and persisting the results to MongoDB.

2020

Overview

Description

Amazgo exposes HTTP endpoints that scrape Amazon pages with Colly, normalize product fields such as title, image, price, and reviews, and store the results with timestamps in MongoDB.

Problem solved

Product extraction often starts as one-off scripts, which makes repeated scraping, structured persistence, and local reproducibility harder than it needs to be.

Role and contributions

Built the API handlers, scraping flow, and local development setup for structured product collection.

  • Implemented scrape and product endpoints for collecting page data
  • Mapped extracted fields into a stored document model with timestamps
  • Added Docker-based bootstrapping for the service and database