Zalando-Beauty-New-Products-Scraper

Zalando Beauty New Product Scraper is a Python script for scraping product information from the Zalando website and saving it to a CSV file. It allows you to collect and monitor new product arrivals in a specific category over time.

Getting Started

Prerequisites

Before using the Zalando Product Scraper, ensure you have the following prerequisites installed:

Python 3.x
pip (Python package manager)

Installation

Clone the repository to your local machine:

  git clone https://github.com/SilentJMA/Zalando-Beauty-New-Product-Scraper.git

Change to the project directory:

cd Zalando-Beauty-New-Product-Scraper

Install the required Python packages:

  pip install -r requirements.txt

Usage

Configuration

Modify the url variable in the script to specify the Zalando category URL you want to scrape.

Define the csv_directory variable to specify the directory where CSV files will be saved.

Set the product_limit variable to limit the number of products to scrape in each run.

Define the import_interval variable to set the interval for running the import (in seconds).

Running the Scraper

To run the scraper, simply execute the script using Python:

python zalando_beauty_scraper.py

The script will scrape the product information, save it to the specified CSV file, and wait for the next scheduled run.

Features

Scrapes product information including brand name, product name, URL, and price.

Handles German product names and UTF-8 encoding.

Checks for new products and skips existing ones.

Scheduled execution for periodic updates.

CSV Data Storage

Each import creates a new CSV file with a timestamp in the filename. Previous CSV files are retained in the specified directory. The CSV files contain data for new products found during each import.

Scraping Product Data

The scraper extracts the following product information:

Brand Name

Product Name (German product names are supported)

Product URL

Product Price

Premium Delivery Status

Monitoring New Products

The script checks for new products by comparing product names with previous imports.
Only new products are saved to the CSV files.
You can monitor product arrivals over time and track changes.

Scheduled Execution

The script runs on a schedule defined by the import_interval variable. It will automatically update the product data at the specified interval.

Contributing

Contributions are welcome! If you have suggestions, improvements, or bug fixes, please open an issue or create a pull request.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

requirements.txt

requirements.txt

zalando_beauty_scraper.py

zalando_beauty_scraper.py

Repository files navigation

Zalando-Beauty-New-Products-Scraper

Table of Contents

Getting Started

Prerequisites

Installation

Usage

Configuration

Running the Scraper

Features

CSV Data Storage

Scraping Product Data

Monitoring New Products

Scheduled Execution

Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
README.md		README.md
requirements.txt		requirements.txt
zalando_beauty_scraper.py		zalando_beauty_scraper.py

SilentJMA/Zalando-Beauty-New-Products-Scraper

Folders and files

Latest commit

History

Repository files navigation

Zalando-Beauty-New-Products-Scraper

Table of Contents

Getting Started

Prerequisites

Installation

Usage

Configuration

Running the Scraper

Features

CSV Data Storage

Scraping Product Data

Monitoring New Products

Scheduled Execution

Contributing

About

Topics

Resources

Stars

Watchers

Forks

Languages