NYC Rental Listing Popularity Study (Fall 2018)
This project focus on designing a more reasonable listing mechanism that can predict the interest level (high, medium, low) of rental apartments based on their numerical and descriptive information. We approached it using logistic regression, random forest, and kNN. The results show that random forest has the best performance in predicting the popularity of rental apartments.
The project is the final project for NYU DS-GA 1007 Programming for Data Science
Data
Data comes from kaggle Two Sigma Connect: Rental Listing Inquiries.
Also uploaded, simply download everything and run the Jupyter Notebooks.
Code
Source Code
Packeges used: pandas, numpy, scikit-learn, etc.
All the code is in the 07 project Jupyter Notebook, sections are Data Importing, Data Cleaning, Model Selection and Implement, and some Data VIsualization in the end.
Authors
- Muyang Jin
- Yixuan Wang
- Xin Wu
- Jianzhi Li