Subspace clustering aims to sort mixed data by uncovering their hidden linear patterns — like sorting fabrics by the weave hidden beneath their colors and designs.
Unfortunately, missing data is ubiquitous in modern applications. For example, medical records often have incomplete test results, sensors in engineering systems can fail to record measurements, and surveys or social science studies frequently contain unanswered questions.
The challenge with missing data is that many linear patterns may fit — like sorting fabrics by weave when you only have a few loose threads from each one.
We investigate the theoretical limits that determine when this type of clustering is possible — how many threads we need — and we design practical algorithms that can cluster efficiently.
Publications |
---|
A. Soni, J. Linderoth, J. Luedtke, and D. Pimentel-Alarcón. "An integer programming approach to subspace clustering with missing data". INFORMS Journal on Optimization. 2025. [Link] |
S. Baskar, K. Rupashree, and D. Pimentel-Alarcón. "Deep union completion". AAAI Conference on Artificial Intelligence. 2025. [Link] |
Huanran Li, D. Pimentel-Alarcón. "Deep fusion: Capturing dependencies in contrastive learning via transformer projection heads". IEEE International Symposium on Information Theory (ISIT). 2025. [Link] |
K. Rupashree, S. Baskar, and D. Pimentel-Alarcón. "Latent union completion". IEEE International Symposium on Information Theory (ISIT). 2025. [Link] |
H. Li and D. Pimentel-Alarcón. "Group-sparse subspace clustering with elastic stars". IEEE International Symposium on Information Theory (ISIT). 2024. [Link] |
H. Li, J. Johnson, and D. Pimentel-Alarcón. "Fusion over the grassmannian for high-rank matrix completion". IEEE International Symposium on Information Theory (ISIT). 2024. [Link] |
H. Li and D. Pimentel-Alarcón. "Visualizing grassmannians via poincare embeddings". International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP). 2023. Best student paper award. [Link] |
B. Kizaric and D. Pimentel-Alarcón. "Principal component trees and their persistent homology". AAAI Conference on Artificial Intelligence. 2023. [Link] |
U. Mahmood and D. Pimentel-Alarcón. "Fusion subspace clustering for incomplete data". International Joint Conference on Neural Networks (IJCNN). 2022. [Link] |
B. Kizaric and D. Pimentel-Alarcón. "Classifying incomplete data with a mixture of subspace experts". Allerton Conference on Communication, Control, and Computing. 2022. [Link] |
A. Soni, J. Linderoth, J. Luedtke, and D. Pimentel-Alarcón. "Integer programming approaches to subspace clustering with missing data". Optimization for Machine Learning, NeurIPS. 2021. [Link] |
G. Ongie, D. Pimentel-Alarcón, L. Balzano, R. Willett, and R. Nowak. "Tensor methods for nonlinear matrix completion". SIAM Journal on Mathematics of Data Science. 2021. [Link] |
D. Pimentel-Alarcón, G. Ongie, L. Balzano, R. Willett, and R. Nowak. "Low algebraic dimension matrix completion". Allerton Conference on Communication, Control, and Computing. 2017. [Link] |