Explore the above links to see my portfolio projects:
- sampleReddit: a Python library that makes it easy for researchers to perform snowball samples of Reddit data.
- Toxicity Data Collection: My work to collect and clean Reddit comments and classify them according to if they contained toxic content. This project contains an early version of my sampleReddit code. It also deploys a transformer neural network model and showcases robust data cleaning and collection procedures.
- Multilabel Emotion Classification: A project that classifies Reddit comments into six emotion categories using a deep recurrent neural network.
- Back Propogation: My own implementation of the back propogation algorithm used in neural networks during gradient descent. This project made a big contribution to my understanding of neural network optimization.