PROXIMUS: Software for Summarization of Very High Dimensional Discrete-Valued Datasets PROXIMUS is a software tool for error-bounded approximation of high-dimensional binary attributed datasets based on nonorthogonal decomposition of binary matrices. This tool can be used for analyzing data arising in a variety of domains ranging from commercial to scientific applications. Using a combination of innovative algorithms, novel data structures, and efficient implementation, PROXIMUS computes a concise representation for very large binary matrices, providing insights into common patterns in the rows and columns of the matrix. PROXIMUS has found application in many areas, including association rule mining, DNA microarray analysis, and business analytics. The original release of PROXIMUS is implemented in C and is freely available as open source below. It was also implemented in R within the CBA (Clustering for Business Analytics) by Christian Buchta and Michael Hahsler and in Java by Jaan Ubi.
Keywords for this software
References in zbMATH (referenced in 9 articles )
Showing results 1 to 9 of 9.
- Krivulin, N. K.; Romanova, E. Yu.: On the rank-one approximation of positive matrices using tropical optimization methods (2019)
- Dong, Bo; Lin, Matthew M.; Park, Haesun: Integer matrix approximation and data mining (2018)
- Karapetyan, Daniel; Punnen, Abraham P.; Parkes, Andrew J.: Markov chain methods for the bipartite Boolean quadratic programming problem (2017)
- Schneider, Johannes; Vlachos, Michail: Scalable density-based clustering with quality guarantees using random projections (2017)
- Punnen, Abraham P.; Sripratak, Piyashat; Karapetyan, Daniel: The bipartite unconstrained 0-1 quadratic programming problem: polynomially solvable cases (2015)
- Jiang, Peng; Peng, Jiming; Heath, Michael; Yang, Rui: A clustering approach to constrained binary matrix factorization (2014)
- van Leeuwen, Matthijs; Vreeken, Jilles; Siebes, Arno: Identifying the components (2009) ioport
- Chi, Jie; Koyuturk, Mehmet; Grama, Ananth: CONQUEST: A coarse-grained algorithm for constructing summaries of distributed discrete datasets (2006)
- Koyuturk, Mehmet; Grama, Ananth; Ramakrishnan, Naren: Compression, Clustering, and Pattern Discovery in Very High-Dimensional Discrete-Attribute Data Sets (2005) ioport