s13Stat946proposal: Difference between revisions
No edit summary |
|||
Line 24: | Line 24: | ||
Finding a low-rank decomposition of a matrix is a well-known problem and has many applications. Sometimes, there are some additional constraints on this problem. For instance, we may require that the decomposition is sparse. In [http://cbio.ensmp.fr/~jvert/svn/bibli/local/Witten2009penalized.pdf], an efficient method is suggested to solve this problem. In particular, some regularization terms (e.g., L1-norms) are added to the optimization problem to guarantee the sparsity. After employing some relaxations, the problem is casted as a biconvex optimization, and is solved iteratively, similar to the power method (but along with a soft-thresholding operator). | Finding a low-rank decomposition of a matrix is a well-known problem and has many applications. Sometimes, there are some additional constraints on this problem. For instance, we may require that the decomposition is sparse. In [http://cbio.ensmp.fr/~jvert/svn/bibli/local/Witten2009penalized.pdf], an efficient method is suggested to solve this problem. In particular, some regularization terms (e.g., L1-norms) are added to the optimization problem to guarantee the sparsity. After employing some relaxations, the problem is casted as a biconvex optimization, and is solved iteratively, similar to the power method (but along with a soft-thresholding operator). | ||
In this project, we want to employ this idea, and develop new algorithms to deal with other type of constraints. Of particular interest, is the group-sparsity constraint on the decomposition. The difficulty comes from the fact that the group-sparsity constraint can bind the problem of selecting different basis-vectors together. Therefore, we cannot select basis-vectors one at a time, but we need to select them together. | In this project, we want to employ this idea, and develop new algorithms to deal with other type of constraints. Of particular interest, is the group-sparsity constraint on the decomposition, that has applications in feature selection algorithms. The difficulty comes from the fact that the group-sparsity constraint can bind the problem of selecting different basis-vectors together. Therefore, we cannot select basis-vectors one at a time, but we need to select them together. Furthermore, the nature of the constraints (e.g., infinity-norms) makes it difficult to find a closed-form solution, whereas for L1-norm we could find one. These facts make finding an efficient solution hard. In this research, we want to see if we can propose a solution to overcome these difficulties. | ||
O |
Revision as of 23:06, 8 July 2013
Project 1 : How to Build a Bird House
By: Maseeh Ghodsi, Soroush Ghodsi and Ali Ghodsi
Project: Dimension Reduction for NBA data
By: Lu Xin and Jiaxi Liang
National Basketball Association (NBA) is one of the biggest sports leagues in North America. Thanks to advanced techniques of data collection, very detailed statistics are available for teams, players, and games throughout the seasons. One important goal of basketball data analysis is to evaluate the performances of teams, lineups and players. The team performances can be easily seen from the rankings of the teams. However, the performance of lineups (5-player combinations) is not so obvious. Furthermore, it is complicated to evaluate players in terms of team playing. For example, Kobe Bryant is certainly a great individual player, but is he a good team player? In this project, we try to apply dimension reduction approaches to deal with such problems. The basic idea is to find low-dimensional representations of teams and lineups. Hopefully, the pattern of the teams and lineups in the latent space can lead to interesting conclusions.
Firstly, we will select dimension reduction approaches by applying them on team statistics. We may consider all possible approaches, linear and nonlinear ones, supervised and unsupervised ones. Since the overall performances of the teams (their ranking) are known, we can choose the methods that yield visualizations that agree with the conclusions drawn from expert knowledge, for instance we expect a clear separation between teams with offensive and defensive styles.
Secondly, we apply the selected methods to lineup data sets and get the plots of the lineups in the low-dimensional space. From the patterns, we may see some interesting team structures. By comparing the lineups with and without certain players, we can tell the effect of such players in the team. We will only focus on important lineups and players.
Project: Large-Scale Supervised Sparse Principal Component Analysis
By: Wu Lin and Wei Wang
One of important issues of dimension reduction technique is scalability when it comes to real-world applications. Recently, there is some published work to address the issue. In the paper[1], the authors proposed a fast and large-scale algorithm to implement Sparse Principal Component Analysis. In our project, we would like to extend this algorithm to supervised version by introducing some supervised metric to the optimization framework such as Hilbert-Schmidt independence criterion (HSIC)
Project: Efficient Group-Sparsed Matrix Decompostion
By: M.Hassan Z.Ashtiani
Finding a low-rank decomposition of a matrix is a well-known problem and has many applications. Sometimes, there are some additional constraints on this problem. For instance, we may require that the decomposition is sparse. In [2], an efficient method is suggested to solve this problem. In particular, some regularization terms (e.g., L1-norms) are added to the optimization problem to guarantee the sparsity. After employing some relaxations, the problem is casted as a biconvex optimization, and is solved iteratively, similar to the power method (but along with a soft-thresholding operator).
In this project, we want to employ this idea, and develop new algorithms to deal with other type of constraints. Of particular interest, is the group-sparsity constraint on the decomposition, that has applications in feature selection algorithms. The difficulty comes from the fact that the group-sparsity constraint can bind the problem of selecting different basis-vectors together. Therefore, we cannot select basis-vectors one at a time, but we need to select them together. Furthermore, the nature of the constraints (e.g., infinity-norms) makes it difficult to find a closed-form solution, whereas for L1-norm we could find one. These facts make finding an efficient solution hard. In this research, we want to see if we can propose a solution to overcome these difficulties.
O