A comprehensive analysis and price prediction system for Disney Resorts hotels worldwide
This project combines traditional data analysis with machine learning to analyze Disney resort hotels globally. It features real-time data scraping, price prediction using PyTorch, and comprehensive statistical analysis. The system can analyze pricing strategies, occupancy patterns, customer satisfaction, and provide future price predictions.
disney-resorts-hotels/
│
├── data-analysis-disney-hotels.py # Original analysis script
├── disney_hotel_ml_predictor.py # ML-based price prediction script
├── requirements.txt # Project dependencies
├── .gitignore # Git ignore file
├── license.md # License information
│
├── data/ # Data directory
│ ├── raw/ # Raw scraped data
│ └── processed/ # Processed datasets
│
├── models/ # Saved ML models
│ └── trained_models/ # Trained model checkpoints
│
├── tests/ # Test directory
│ └── test_scraper.py # Scraper tests
│ └── test_predictor.py # Predictor tests
│
└── logs/ # Logging directory
└── scraping_logs/ # Web scraping logs
- Python 3.8+
- Required packages:
numpy>=1.21.0
pandas>=1.3.0
matplotlib>=3.4.0
seaborn>=0.11.0
scipy>=1.7.0
scikit-learn>=0.24.0
beautifulsoup4>=4.9.3
requests>=2.25.1
pytorch>=2.0.0
selenium>=4.1.0
webdriver_manager>=3.8.0
python-dateutil>=2.8.2
tqdm>=4.65.0
-
Clone the repository:
git clone https://github.com/yourusername/disney-resorts-hotels.git cd disney-resorts-hotels
-
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install dependencies:
pip install -r requirements.txt
The project offers two main scripts:
- Traditional Analysis:
python data-analysis-disney-hotels.py
- ML-based Price Prediction:
python disney_hotel_ml_predictor.py
- Automated web scraping of Disney resort websites
- Real-time price and availability data
- Historical data collection and storage
- Rate limiting and error handling
- Statistical analysis of hotel metrics
- Price trend analysis
- Occupancy patterns
- Seasonal variations
- Special event impact analysis
- Customer satisfaction correlation
- Neural network-based price prediction
- Feature engineering for ML models
- Real-time price forecasting
- Model performance evaluation
- GPU acceleration support
- Automated model retraining
- Interactive visualizations
- Price trend charts
- Occupancy heat maps
- Satisfaction correlation matrices
- Performance metrics dashboards
The system can be customized through various configuration options:
- Scraping Parameters:
scraper_config = {
'delay': 2, # Seconds between requests
'max_retries': 3,
'timeout': 30
}
- Model Parameters:
model_config = {
'learning_rate': 0.001,
'batch_size': 32,
'epochs': 100,
'hidden_layers': [64, 32]
}
The system generates:
- Processed datasets in CSV format
- Trained ML models
- Detailed analysis reports
- Visualization plots
- Performance metrics
- Log files
- Fork the repository
- Create a feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the Apache-2.0 license - see the LICENSE.md file for details.
- Implementation of advanced ML architectures
- Addition of more Disney resorts
- Integration with booking systems
- Real-time price alerts
- Mobile app development
- API endpoint creation
- Enhanced visualization options
Common issues and solutions:
- Scraping errors: Check network connection and update selectors
- Model training issues: Verify GPU availability and memory usage
- Data processing errors: Validate input data format and completeness
For questions and support:
- Open an issue on GitHub
- Check existing documentation
- Review closed issues for solutions
- Disney resorts data architecture
- PyTorch community
- Web scraping best practices
- Hospitality industry insights