Random matrix approach to multivariate categorical data analysis

Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical da...

Full description

Bibliographic Details
Published in:Physical Review E
Main Authors: Patil, Aashay, Santhanam, M. S.
Format: Text
Language:unknown
Published: 2015
Subjects:
Online Access:http://arxiv.org/abs/1503.06559
https://doi.org/10.1103/PhysRevE.92.032130
id ftarxivpreprints:oai:arXiv.org:1503.06559
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:1503.06559 2023-09-05T13:21:29+02:00 Random matrix approach to multivariate categorical data analysis Patil, Aashay Santhanam, M. S. 2015-03-23 http://arxiv.org/abs/1503.06559 https://doi.org/10.1103/PhysRevE.92.032130 unknown http://arxiv.org/abs/1503.06559 Phys. Rev. E 92, 032130 (2015) doi:10.1103/PhysRevE.92.032130 Physics - Data Analysis Statistics and Probability text 2015 ftarxivpreprints https://doi.org/10.1103/PhysRevE.92.032130 2023-08-16T13:36:23Z Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical data. This is of interest due to a deluge of categorical data, such as movie ratings, top-10 rankings and data from social media, in the public domain that require analysis. We show that the statistical properties of the spectra of similarity matrices, constructed from categorical data, follow those from random matrix theory. We demonstrate this approach by applying it to the data of Indian general elections and sea level pressures in North Atlantic ocean. Comment: 6 pages, 7 figures Text North Atlantic ArXiv.org (Cornell University Library) Indian Physical Review E 92 3
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Physics - Data Analysis
Statistics and Probability
spellingShingle Physics - Data Analysis
Statistics and Probability
Patil, Aashay
Santhanam, M. S.
Random matrix approach to multivariate categorical data analysis
topic_facet Physics - Data Analysis
Statistics and Probability
description Correlation and similarity measures are widely used in all the areas of sciences and social sciences. Often the variables are not numbers but are instead qualitative descriptors called categorical data. We define and study similarity matrix, as a measure of similarity, for the case of categorical data. This is of interest due to a deluge of categorical data, such as movie ratings, top-10 rankings and data from social media, in the public domain that require analysis. We show that the statistical properties of the spectra of similarity matrices, constructed from categorical data, follow those from random matrix theory. We demonstrate this approach by applying it to the data of Indian general elections and sea level pressures in North Atlantic ocean. Comment: 6 pages, 7 figures
format Text
author Patil, Aashay
Santhanam, M. S.
author_facet Patil, Aashay
Santhanam, M. S.
author_sort Patil, Aashay
title Random matrix approach to multivariate categorical data analysis
title_short Random matrix approach to multivariate categorical data analysis
title_full Random matrix approach to multivariate categorical data analysis
title_fullStr Random matrix approach to multivariate categorical data analysis
title_full_unstemmed Random matrix approach to multivariate categorical data analysis
title_sort random matrix approach to multivariate categorical data analysis
publishDate 2015
url http://arxiv.org/abs/1503.06559
https://doi.org/10.1103/PhysRevE.92.032130
geographic Indian
geographic_facet Indian
genre North Atlantic
genre_facet North Atlantic
op_relation http://arxiv.org/abs/1503.06559
Phys. Rev. E 92, 032130 (2015)
doi:10.1103/PhysRevE.92.032130
op_doi https://doi.org/10.1103/PhysRevE.92.032130
container_title Physical Review E
container_volume 92
container_issue 3
_version_ 1776202092322488320