Abstract
With the advance of technology, the collection and storage of data have become routine. Huge amounts of data are increasingly produced from biology, meteorology, psychology, chemistry, and economics experiments. As technology progresses, these high-dimension problems are becoming more and more common. The "large p, small n" problem, in which there are more variables than samples, is currently a challenge that many statisticians face especially when it becomes multiresponse. Many researchers have resorted to using the sparse regression of the response variable in the multivariate case. A sparse matrix is defined as a matrix with the majority of its members equal to zero. A sparse matrix's zero entries reduced the number of parameters that may be interpreted. In this paper, we focus on the comparison between the four method: SiER, SRRR, remMap, SPLS in the selection of variables that affect the hormones that cause for the data of a group of patients with thyroid disorders. Software implementing the method is publicly available in the R package sparse-reg..