Skip to content

A Python script for scraping and monitoring new product arrivals on Zalando, a popular online fashion retailer.

Notifications You must be signed in to change notification settings

SilentJMA/Zalando-Beauty-New-Products-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 

Repository files navigation

Zalando-Beauty-New-Products-Scraper

Zalando Beauty New Product Scraper is a Python script for scraping product information from the Zalando website and saving it to a CSV file. It allows you to collect and monitor new product arrivals in a specific category over time.

Table of Contents

Getting Started

Prerequisites

Before using the Zalando Product Scraper, ensure you have the following prerequisites installed:

  • Python 3.x
  • pip (Python package manager)

Installation

  1. Clone the repository to your local machine:
  git clone https://github.com/SilentJMA/Zalando-Beauty-New-Product-Scraper.git
  1. Change to the project directory:
cd Zalando-Beauty-New-Product-Scraper
  1. Install the required Python packages:
  pip install -r requirements.txt

Usage

Configuration

  • Modify the url variable in the script to specify the Zalando category URL you want to scrape.
  • Define the csv_directory variable to specify the directory where CSV files will be saved.
  • Set the product_limit variable to limit the number of products to scrape in each run.
  • Define the import_interval variable to set the interval for running the import (in seconds).
  • Running the Scraper

    To run the scraper, simply execute the script using Python:

    python zalando_beauty_scraper.py

    The script will scrape the product information, save it to the specified CSV file, and wait for the next scheduled run.

    Features

  • Scrapes product information including brand name, product name, URL, and price.
  • Handles German product names and UTF-8 encoding.
  • Checks for new products and skips existing ones.
  • Scheduled execution for periodic updates.
  • CSV Data Storage

    Each import creates a new CSV file with a timestamp in the filename. Previous CSV files are retained in the specified directory. The CSV files contain data for new products found during each import.

    Scraping Product Data

    The scraper extracts the following product information:

  • Brand Name
  • Product Name (German product names are supported)
  • Product URL
  • Product Price
  • Premium Delivery Status
  • Monitoring New Products

    The script checks for new products by comparing product names with previous imports.
    Only new products are saved to the CSV files.
    You can monitor product arrivals over time and track changes.

    Scheduled Execution

    The script runs on a schedule defined by the import_interval variable. It will automatically update the product data at the specified interval.

    Contributing

    Contributions are welcome! If you have suggestions, improvements, or bug fixes, please open an issue or create a pull request.

    About

    A Python script for scraping and monitoring new product arrivals on Zalando, a popular online fashion retailer.

    Topics

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Languages