Overview | Dataset | Notebooks | Technologies | Paper | Contribution
A simple web application to predict crime probability in New York City depending on user information, location and time.
This work relies on NYPD Complaint Data Historic Dataset. This dataset includes all valid felony, misdemeanor, and violation crimes reported to the New York City Police Department (NYPD) from 2006 to 2019. The dataset contains 6901167 complaints and 35 columns including spatial and temporal information about crime occurrences along with their description and penal classification.
Different notebooks are provided for :
This web application is developd using:
- Streamlit
- Folium
For data cleaning, EDA and modeling:
- Pandas
- seaborn
- matplotlib
- Scikit-learn
- lightgbm
To install requirements run:
pip install -r requirements.txt
To document our work, we wrote a research paper that is included in the repo.