F21-STAT 441/841 CM 763-Proposal: Difference between revisions

From statwiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 179: Line 179:
Kahn, Solomon  
Kahn, Solomon  


'''Title:''' Spotify Recommendation Engine
'''Title:''' Increasing Spotify user engagement through predictive personalization


'''Description:'''  
'''Description:'''  

Revision as of 17:59, 7 October 2020

Use this format (Don’t remove Project 0)

Project # 0 Group members:

Last name, First name

Last name, First name

Last name, First name

Last name, First name

Title: Making a String Telephone

Description: We use paper cups to make a string phone and talk with friends while learning about sound waves with this science project. (Explain your project in one or two paragraphs).


Project # 1 Group members:

Song, Quinn

Loh, William

Bai, Junyue

Choi, Phoebe

Title: APTOS 2019 Blindness Detection

Description:

Our team chose the APTOS 2019 Blindness Detection Challenge from Kaggle. The goal of this challenge is to build a machine learning model that detects diabetic retinopathy by screening retina images.

Millions of people suffer from diabetic retinopathy, the leading cause of blindness among working-aged adults. It is caused by damage to the blood vessels of the light-sensitive tissue at the back of the eye (retina). In rural areas where medical screening is difficult to conduct, it is challenging to detect the disease efficiently. Aravind Eye Hospital hopes to utilize machine learning techniques to gain the ability to automatically screen images for disease and provide information on how severe the condition may be.

Our team plans to solve this problem by applying our knowledge in image processing and classification.



Project # 2 Group members:

Li, Dylan

Li, Mingdao

Lu, Leonie

Sharman,Bharat

Title: Risk prediction in life insurance industry using supervised learning algorithms

Description:

In this project, we aim to replicate and possibly improve upon the work of Jayabalan et al. in their paper “Risk prediction in life insurance industry using supervised learning algorithms”. We will be using the Prudential Life Insurance Data Set that the authors have used and have shared with us. We will be pre-processing the data to replace missing values, using feature selection using CFS and feature reduction using PCA use this processed data to perform Classification via four algorithms – Neural Networks, Random Tree, REPTree and Multiple Linear Regression. We will compare the performance of these Algorithms using MAE and RMSE metrics and come up with visualizations that can explain the results easily even to a non-quantitative audience.

Our goal behind this project is to learn applying the algorithms that we learned in our class to an industry dataset and come up with results that we can aid better, data-driven decision making.


Project # 3 Group members:

Parco, Russel

Sun, Scholar

Yao, Jacky

Zhang, Daniel

Title: Lyft Motion Prediction for Autonomous Vehicles

Description:

Our team has decided to participate in the Lyft Motion Prediction for Autonomous Vehicles Kaggle competition. The aim of this competition is to build a model which given a set of objects on the road (pedestrians, other cars, etc), predict the future movement of these objects.

Autonomous vehicles (AVs) are expected to dramatically redefine the future of transportation. However, there are still significant engineering challenges to be solved before one can fully realize the benefits of self-driving cars. One such challenge is building models that reliably predict the movement of traffic agents around the AV, such as cars, cyclists, and pedestrians.

Our aim is to apply classification techniques learned in class to optimally predict how these objects move.


Project # 4 Group members:

Chow, Jonathan

Dharani, Nyle

Nasirov, Ildar

Title: Classification with Abstinence

Description:

We seek to implement the algorithm described in Deep Gamblers: Learning to Abstain with Portfolio Theory. The paper describes augmenting classification problems to include the option of abstaining from making a prediction when confidence is low.

Medical imaging diagnostics is a field in which classification could assist professionals and improve life expectancy for patients through increased accuracy. However, there are also severe consequences to incorrect predictions. As such, we also hope to apply the algorithm implemented to the classification of medical images, specifically instances of normal and pneumonia chest x-rays.


Project # 5 Group members:

Jones, Hayden

Leung, Michael

Haque, Bushra

Mustatea, Cristian

Title: Combine Convolution with Recurrent Networks for Text Classification

Description:

Our team chose to reproduce the paper Combine Convolution with Recurrent Networks for Text Classification on Arxiv. The goal of this paper is to combine CNN and RNN architectures in a way that more flexibly combines the output of both architectures other than simple concatenation through the use of a “neural tensor layer” for the purpose of improving at the task of text classification. In particular, the paper claims that their novel architecture excels at the following types of text classification: sentiment analysis, news categorization, and topical classification. Our team plans to recreate this paper by working in pairs of 2, one pair to implement the CNN pipeline and the other pair to implement the RNN pipeline. We will be working with Tensorflow 2, Google Collab, and reproducing the paper’s experimental results with training on the same 6 publicly available datasets found in the paper.


Project # 6 Group members:

Chin, Ruixian

Ong, Jason

Chiew, Wen Cheen

Tan, Yan Kai

Title: Mechanisms of Action (MoA) Prediction

Description:

Our team chose to participate in a Kaggle research challenge "Mechanisms of Action (MoA) Prediction". This competition is a project within the Broad Institute of MIT and Harvard, the Laboratory for Innovation Science at Harvard (LISH), and the NIH Common Funds Library of Integrated Network-Based Cellular Signatures (LINCS), present this challenge with the goal of advancing drug development through improvements to MoA prediction algorithms.


Project # 7 Group members:

Ren, Haotian

Cheung, Ian Long Yat

Hussain, Swaleh

Zahid, Bin, Haris

Title: Transaction Fraud Detection

Description:

Protecting people from fraudulent transactions is an important topic for all banks and internet security companies. This Kaggle project is based on the dataset from IEEE Computational Intelligence Society (IEEE-CIS). Our objective is to build a more efficient model in order to recognize each fraud transaction with a higher accuracy and higher speed.


Project # 8 Group members:

ZiJie, Jiang

Yawen, Wang

DanMeng, Cui

MingKang, Jiang

Title: Lyft Motion Prediction for Autonomous Vehicles

Description:

Our team chose to participate in the Kaggle Challenge "Lyft Motion Prediction for Autonomous Vehicles". We will apply our science skills to build motion prediction models for self-driving vehicles. The model will be able to predict the movement of traffic agents around the AV, such as cars, cyclists, and pedestrians. The goal of this competition is to predict the trajectories of other traffic participants.



Project # 9 Group members:

Banno, Dion

Battista, Joseph

Kahn, Solomon

Title: Increasing Spotify user engagement through predictive personalization

Description:

Our project is an application of classification to the domain of predictive personalization. The goal of the project is to increase Spotify user engagement through data-driven methods. Given a set of users’ demographic data, listening preferences and behaviour, our goal is to build a recommendation system that suggests new songs to users. From a potential pool of songs to suggest, the final song recommendations will be driven by a classification algorithm that measures a given user’s propensity to like a song. We plan on leveraging the Spotify Web API to gather data about songs.