Skip to content

Jelared/Project-GEIPAN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GEIPAN Data Scraper

About

A Python script for scraping data from the GEIPAN (Groupe d'Études et d'Informations sur les Phénomènes Aérospatiaux Non Identifiés). The script extracts information about UFO sightings, including the city, phenomenon, department, case classification, and observation date. The data is saved into a CSV file for further analysis.

Features

  • Data Extraction: Scrape data from multiple pages of the GEIPAN website.
  • Data Processing: Organize the extracted data into a pandas DataFrame.
  • CSV Export: Save the processed data into a CSV file.

Technology

  • Python 3.x
  • BeautifulSoup
  • Requests
  • pandas
  • tkinter (for file dialog)

Usage

  1. Clone the Repository
git clone https://github.com/your-username/geipan-data-scraper.git
cd geipan-data-scraper
  1. Install Dependencies

Install the required Python libraries using pip

pip install beautifulsoup4 requests pandas
  1. Run the Script

nsure you have the script GEIPAN_Data_Scraper.ipynb in your directory. Then run the Jupyter notebook or convert it to a Python script and run it

jupyter notebook GEIPAN_Data_Scraper.ipynb

or convert to Python script

jupyter nbconvert --to script GEIPAN_Data_Scraper.ipynb
python GEIPAN_Data_Scraper.py
  1. Export Data

Follow the prompts to select the location to save the CSV file.

Licence

This project is licensed under the MIT License - see the LICENSE file for details.

About

Basic data extraction from website GEIPAN

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published