This project collection leverages feature engineering, data preprocessing, and supervised learning models to achieve accurate predictions.
- Data Preprocessing: Handles missing data, encodes categorical variables, and scales numerical features for optimal model performance.
- Feature Engineering: Extracts relevant features to improve prediction accuracy.
- Modeling: Trains and evaluates supervised learning models to predict behaviors.
- Visualization: Includes plots to analyze data trends and model performance.
To run the notebook, you need the following dependencies:
- Python 3.7+
- Jupyter Notebook or JupyterLab
- pandas
- numpy
- scikit-learn
- matplotlib
- seaborn
Install the required libraries using the following command:
pip install -r requirements.txt
-
Clone the repository:
git clone https://github.com/yourusername/airbnb-availability-predictor.git
-
Navigate to the project directory:
cd airbnb-availability-predictor
-
Launch Jupyter Notebook:
jupyter notebook Listings_Availability.ipynb
-
Run the cells in the notebook sequentially to:
- Load and preprocess the data
- Perform exploratory data analysis (EDA)
- Train and evaluate machine learning models
The notebook provides:
- A detailed analysis of features impacting the models' ability to make predictions.
- Performance metrics (e.g., accuracy, precision, recall) of the trained models.
- Visualizations to understand data distribution and model predictions.
Contributions are welcome! If you have ideas for improving the project or adding new features, feel free to submit a pull request.
This project is licensed under the MIT License. See the LICENSE
file for details.
Thank you to my mentors at MIT for their patience and support during this learning process!