Singular Value Decomposition and factor analysis in Social Science and Humanities

Authors

  • Sergio A. Pernice Universidad del CEMA

DOI:

https://doi.org/10.46661/rev.metodoscuant.econ.empresa.8004

Keywords:

Singular Value Decomposition, Factor Analysis, Humanities and Social Sciences

Abstract

The objects of study of the humanities and social sciences are intrinsically complex. Because it is philosophically attractive, and because it helps in practice to manage
such complexity, one of the most influential central ideas throughout the history and present of these disciplines is the notion that the large number of empirical manifestations that characterize their objects of study are actually expressions of a few factors that influence all other variables. The corresponding statistical methodology to implement these ideas has different names and differs in detail in different disciplines, but one name that can be recognized in many of them is “factor analysis”. The first objective of this work is to present a classical method of
linear algebra, known as “Singular Value Decomposition” (SVD), in an intuitive and at the same time rigorous way to the community of human and social sciences.
SVD systematizes and generalizes the factorization of any data matrix. In addition, the method is of enormous importance in the era of big data and machine learning,
which are increasingly influencing research in all areas of study. The second objective is to invite questioning of certain hypotheses in traditional factor analysis. The SVD
reveals that factors are inherent in any matrix-structured data set; what is crucial is how singular values decay. Data will determine this decay, with potentially profoundly
transformative theoretical repercussions.

Downloads

Download data is not yet available.

References

Alter, O., Brown, P. O., & Botstein, D. (2000). Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences of the United States of America, 97(18), 10101-10106.

Athey, S., Bayati, M., Doudchenko, N., Imbens, G., & Khosravi, K. (2021). Matrix completion methods for causal panel data models. Journal of the American Statistical Association, 116(536), 1716-1730.https://doi.org/10.1080/01621459.2021.1891924

Athey, S. (2019). 21. The Impact of Machine Learning on Economics. In A. Agrawal, J. Gans & A. Goldfarb (Ed.), The Economics of Artificial Intelligence: An Agenda (pp. 507-552). Chicago: University of Chicago Press.https://doi.org/10.7208/chicago/9780226613475.003.0021

Athey, S., & Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685-725.https://doi.org/10.1146/annurev-economics-080217-053433

Bai, J., & Ng, S. (2002). Determining the number of factors in approximate factor models. Econometrica, 70(1), 191-221.https://doi.org/10.1111/1468-0262.00273

Bai, J., & Ng, S. (2008). Large dimensional factor analysis. Foundations and Trends® in Econometrics, 3(2), 89-163.

https://doi.org/10.1561/0800000002

Bolukbasi, T., Chang, K. W., Zou, J. Y., Saligrama, V., & Kalai, A. T. (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. Advances in Neural Information Processing Systems, 29: https://scholar.google.com/scholar_lookup?arxiv_id=1607.06520

Burt, C. (1909). Experimental tests of general intelligence. British Journal of Psychology, 3(1), 94.https://doi.org/10.1111/j.2044-8295.1909.tb00197.x

Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6), 391-407.https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

Eckart, C., & Young, G. (1936). The approximation of one matrix by another of lower rank. Psychometrika, 1(3), 211-218.https://doi.org/10.1007/BF02288367

Maxwell-Garnett, J. C. (1919). On certain independent factors in mental measurements. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character, 96(675), 91-111.https://doi.org/10.1098/rspa.1919.0041

Harman, H. H. (1976). Modern factor analysis. University of Chicago press.

Holter, N. S., Mitra, M., Maritan, A., Cieplak, M., Banavar, J. R., & Fedoroff, N. V. (2000). Fundamental patterns underlying gene expression profiles: simplicity from complexity. Proceedings of the National Academy of Sciences, 97(15), 8409-8414.

https://doi.org/10.1073/pnas.150242097

Holzinger, K. J. (1930). Statistical résumé of the Spearman two-factor theory. (Mimeographed).

Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical transactions of the royal society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202.https://doi.org/10.1098/rsta.2015.0202

Lay, D. C. (2007). Álgebra Lineal y sus Aplicaciones. Pearson educación.

Liberty, E., Woolfe, F., Martinsson, P. G., Rokhlin, V., & Tygert, M. (2007). Randomized algorithms for the low-rank approximation of matrices. Proceedings of the National Academy of Sciences, 104(51), 20167-20172.https://doi.org/10.1073/pnas.0709640104

Lintner, J. (1975). The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. In Stochastic optimization models in finance (pp. 131-155). Academic Press.

Mossin, J. (1966). Equilibrium in a capital asset market. Econometrica: Journal of the Econometric Society, 768-783.https://doi.org/10.2307/1910098

Muller, N., Magaia, L., & Herbst, B. M. (2004). Singular value decomposition, eigenfaces, and 3D reconstructions. SIAM Review, 46(3), 518-545.https://doi.org/10.1137/S0036144501387517

Novembre, J., Johnson, T., Bryc, K., Kutalik, Z., Boyko, A. R., Auton, A., ... & Bustamante, C. D. (2008). Genes mirror geography within Europe. Nature, 456(7218), 98-101.https://doi.org/10.1038/nature07331

Pearson, K. (1901). LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 2(11), 559-572.https://doi.org/10.1080/14786440109462720

Ross, S. A. (2013). The arbitrage theory of capital asset pricing. In Handbook of the fundamentals of financial decision making: Part I (pp. 11-30).https://doi.org/10.1142/9789814417358_0001

Sharpe, W. F. (1964). Capital asset prices: A theory of market equilibrium under conditions of risk. The Journal of Finance, 19(3), 425-442.

https://doi.org/10.1111/j.1540-6261.1964.tb02865.x

Spearman, C. (1904). General Intelligence, Objectively Determined and Measured. The American Journal of Psychology, 15(2), 201-292. https://doi.org/10.2307/1412107.

Spearman, C. (1927). The Abilities of Man: Their Nature and Measurement. Journal of Philosophical Studies, 2(8), 557-560.https://doi.org/10.1017/S0031819100015825

Stewart, G. (1993): On the Early History of the Singular Value Decomposition". SIAM Review, 35( 4), 551-566.https://doi.org/10.1137/1035134

Stock, J. & Watson, M. (2011). Dynamic factor models. Oxford Handbooks Online.https://doi.org/10.1093/oxfordhb/9780195398649.013.0003

Stock, J. & Watson, M. (2015). Factor Models for Macroeconomics. En Taylor, J. B. y Uhlig, H. (Eds.), Handbook of Macroeconomics (Vol. 2). North Holland.

Thomson, G. (1938). Methods of Estimating Mental Factors. Nature, 141, 246.https://doi.org/10.1038/141246a0

Thurstone, L. (1931). Multiple factor analysis. Psychological Review, 38(5), p. 406.https://doi.org/10.1037/h0069792

Thurstone, L. (1947). Multiple-factor analysis; a development and expansion of The Vectors of Mind. University of Chicago Press.

Turk, M., & Pentland, A. (1991a). Eigenfaces for recognition. Journal of cognitive neuroscience, 3(1), 71-86. https://scholar.google.com.ar/scholar?q=Eigenfaces+for+recognition.+Journal+of+Cognitive+Neuroscience,+(3).&hl=es&as_sdt=0&as_vis=1&oi=scholarthttps://doi.org/10.1162/jocn.1991.3.1.71

Turk, M. & Pentland, A. (1991b). Face recognition using Eigenfaces. Proc. of Computer Vision and Pattern Recognition, (3), 586-591.

Published

2024-03-01

How to Cite

Pernice, S. A. (2024). Singular Value Decomposition and factor analysis in Social Science and Humanities. Journal of Quantitative Methods for Economics and Business Administration, 37, 1–29. https://doi.org/10.46661/rev.metodoscuant.econ.empresa.8004

Issue

Section

Articles