Random matrix approach to multivariate categorical data analysis
Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical da...
Published in: | Physical Review E |
---|---|
Main Authors: | , |
Format: | Text |
Language: | unknown |
Published: |
2015
|
Subjects: | |
Online Access: | http://arxiv.org/abs/1503.06559 https://doi.org/10.1103/PhysRevE.92.032130 |
id |
ftarxivpreprints:oai:arXiv.org:1503.06559 |
---|---|
record_format |
openpolar |
spelling |
ftarxivpreprints:oai:arXiv.org:1503.06559 2023-09-05T13:21:29+02:00 Random matrix approach to multivariate categorical data analysis Patil, Aashay Santhanam, M. S. 2015-03-23 http://arxiv.org/abs/1503.06559 https://doi.org/10.1103/PhysRevE.92.032130 unknown http://arxiv.org/abs/1503.06559 Phys. Rev. E 92, 032130 (2015) doi:10.1103/PhysRevE.92.032130 Physics - Data Analysis Statistics and Probability text 2015 ftarxivpreprints https://doi.org/10.1103/PhysRevE.92.032130 2023-08-16T13:36:23Z Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical data. This is of interest due to a deluge of categorical data, such as movie ratings, top-10 rankings and data from social media, in the public domain that require analysis. We show that the statistical properties of the spectra of similarity matrices, constructed from categorical data, follow those from random matrix theory. We demonstrate this approach by applying it to the data of Indian general elections and sea level pressures in North Atlantic ocean. Comment: 6 pages, 7 figures Text North Atlantic ArXiv.org (Cornell University Library) Indian Physical Review E 92 3 |
institution |
Open Polar |
collection |
ArXiv.org (Cornell University Library) |
op_collection_id |
ftarxivpreprints |
language |
unknown |
topic |
Physics - Data Analysis Statistics and Probability |
spellingShingle |
Physics - Data Analysis Statistics and Probability Patil, Aashay Santhanam, M. S. Random matrix approach to multivariate categorical data analysis |
topic_facet |
Physics - Data Analysis Statistics and Probability |
description |
Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical data. This is of interest due to a deluge of categorical data, such as movie ratings, top-10 rankings and data from social media, in the public domain that require analysis. We show that the statistical properties of the spectra of similarity matrices, constructed from categorical data, follow those from random matrix theory. We demonstrate this approach by applying it to the data of Indian general elections and sea level pressures in North Atlantic ocean. Comment: 6 pages, 7 figures |
format |
Text |
author |
Patil, Aashay Santhanam, M. S. |
author_facet |
Patil, Aashay Santhanam, M. S. |
author_sort |
Patil, Aashay |
title |
Random matrix approach to multivariate categorical data analysis |
title_short |
Random matrix approach to multivariate categorical data analysis |
title_full |
Random matrix approach to multivariate categorical data analysis |
title_fullStr |
Random matrix approach to multivariate categorical data analysis |
title_full_unstemmed |
Random matrix approach to multivariate categorical data analysis |
title_sort |
random matrix approach to multivariate categorical data analysis |
publishDate |
2015 |
url |
http://arxiv.org/abs/1503.06559 https://doi.org/10.1103/PhysRevE.92.032130 |
geographic |
Indian |
geographic_facet |
Indian |
genre |
North Atlantic |
genre_facet |
North Atlantic |
op_relation |
http://arxiv.org/abs/1503.06559 Phys. Rev. E 92, 032130 (2015) doi:10.1103/PhysRevE.92.032130 |
op_doi |
https://doi.org/10.1103/PhysRevE.92.032130 |
container_title |
Physical Review E |
container_volume |
92 |
container_issue |
3 |
_version_ |
1776202092322488320 |