Skip to content

DaniilRoman/avito_data_mining

Repository files navigation

Project for analyze rent of real estate at Nizhny Novgorod


Project has 2 part.

  1. Web scraping and data processing.
  2. Visualizing of basic statistics with assumptions.

Data retrieval

Data has been scrabed from Avito.ru and enriached using yandex geocoder and open street map

For building ETL pipline Airflow has been used.

for our purposes Airflow is overkill because we have just sequenced pipeline but for future it is convenient way to increase complexity and using parallel processing

Data processing DAG located in here and looks like that: Graph structure Admin panel

Airflow docker

For running airflow used docker-compose from github.com/puckel with little bit changes for import necessary dependencies from requirements.txt

Visualizing

Jupyter notebook with data analysis

Prices by longitude and latitude 2019 vs 2020 prices Prices by apartment type

and 2019 data analysis with some machine learning in here

final data in here

Releases

No releases published

Packages

No packages published

Languages