Contributed Poster Presentations: Isolated Statisticians

Shirin Golchi Chair
McGill University
 
Wednesday, Aug 6: 10:30 AM - 12:20 PM
4164 
Contributed Posters 
Music City Center 
Room: CC-Hall B 

Main Sponsor

Isolated Statisticians

Presentations

15: Efficient Designs for Multivariate Crossover Trials

This article aims to study efficient/trace optimal designs for crossover trials with multiple responses recorded from each subject in the time periods. A multivariate fixed effects model is proposed with direct and carryover effects corresponding to the multiple responses. The corresponding error dispersion matrix is chosen to be either proportional or the generalized Markov covariance type, permitting the existence of direct and cross-correlations within and between the multiple responses. The corresponding information matrices for direct effects under the two types of dispersions are used to determine efficient designs. The efficiency of orthogonal array designs of Type I and strength 2 is investigated for a wide choice of covariance functions, namely, Mat(0.5), Mat(1.5) and Mat(∞). To motivate these multivariate crossover designs, a gene expression dataset in a 3×3 framework is utilized. 

Keywords

Completely symmetric

Correlated response

Markov-type covariance

Orthogonal arrays

Proportional covariance

Trace optimal 

Co-Author

Siuli Mukhopadhyay, Indian Institute of Technology Bombay

First Author

Shubham Sanjay Niphadkar

Presenting Author

Shubham Sanjay Niphadkar

16: Group-Based Trajectory Modeling of Serum Potassium Levels in Terminal Phase Gastrointestinal Cancer Patients

In recent years, electronic medical record data has been increasingly utilized, and there is a growing need for trajectory and classification of electrolyte concentration data obtained from patients' routine blood drawings.
In this study, Group-Based Trajectory Modeling (GBTM) was applied to medical record data of patient in the terminal phase of the disease in Japan.
It is important to predict the prognosis and sudden change of condition of terminally ill patients, and the classification of trajectories and their eventual inclusion in the medical record will be useful for patient prediction.
The study utilized blood collection data from 4,981 terminally ill patients admitted to a Japanese university hospital between 2008 and 2016.
Focusing on potassium levels, which are known to decrease significantly at the end of life, GBTM was applied. The five groups that best fit the BIC were: very low [1] (7.2%), low [2] (32.5%), normal [3] (15.6%), high [4] (32.9%), and very high [5] (11.8%).
 

Keywords

Utilization of Electronic Medical Records

Group-Based Multivariate Trajectory Modeling

end-stage potassium 

Co-Author

Ayano Takeuchi, Keio University

First Author

Zixuan Bai

Presenting Author

Zixuan Bai

17: Limiting Spectral Distribution of Large Kendall's Correlation Matrix and its Application

The limiting spectral distribution (LSD) of high-dimensional Kendall's correlation matrix and tests of independence based on this matrix have been studied in the literature when the observations are absolutely continuous with respect to Lebesgue measure, and are independent and identically distributed. Here the investigation is focused on the LSD of Kendall's correlation matrix under much weaker assumptions which accommodate discrete and/or non-identical distributions, and also identify the limit distribution by using free probability. A graphical test of independence was also proposed under these assumptions. 

Keywords

Limiting Spectral Distribution

, free cumulant, Hoeffding's decomposition

Hoeffding's decomposition, Marcenko Pastur Law

Marcenko Pastur Law

non-crossing partition

random matrix 

First Author

Raunak Shevade

Presenting Author

Raunak Shevade

18: Statistics and Machine Learning in the Analysis of Undergraduate Programs at a Brazilian University:

The decrease in demand and dropout rates in undergraduate programs in exact and technological sciences are problems that have implications for society, affecting the job market and potentially leading to a shortage of professionals. Statistical and machine learning methods can contribute to a better understanding of this phenomenon and assist in the design of actions at universities. The objective of this work is to outline the student profile and identify factors for dropout, completion, and retention in undergraduate programs. The research was conducted on a technological sciences campus, including students in engineering, computing, and mathematics teaching. Data from the academic system were used. Exploratory analysis and data visualization techniques were applied to construct the profile. Logistic regression and tree-based models were used to identify the success factors in program completion. The partial results indicate that the student profile varies according to the program. In mathematics and engineering programs, the dropout rate is high and first-phase courses have a strong impact on student dropout,indicating the need for specific actions aimed at incoming students. 

Keywords

Data visualization

Logistic regression

Higher education

Student profile

Student success 

Co-Author

Elisa Henning, Universidade Do Estado De Santa Catarina

First Author

Ivanete Zuchi Siple, Universidade do Estado de Santa Catarina

Presenting Author

Ivanete Zuchi Siple, Universidade do Estado de Santa Catarina

19: Two Sample Test for High-dimensional Means Under Missing Observations

Statistical test for high-dimensional means under missing observations seems to be very rare in the literature. We propose a new two-sample test for high-dimensional means based on independent observations with missing values. The critical region of the proposed test is based on a bootstrap estimate of the sample quantiles of the proposed test statistic. Unlike the existing tests, this test does not require any distributional assumptions or any particular correlation structure of the covariance matrices. We establish the Gaussian approximation result for the proposed test statistic which is a non-trivial extension of the two-sample Gaussian approximation result with no missing values. The rate of accuracy of the bootstrap approximation of the sample quantile of the proposed test statistic is also derived. This Gaussian approximation result and the accuracy of the bootstrap estimators together provide the theoretical guarantees on the size and power of the proposed test. 

Keywords

high-dimensional central limit theorem

Kolmogorov distance

multiplier bootstrap

power function 

First Author

Shiv Kumar Yadav

Presenting Author

Shiv Kumar Yadav