Influence analysis of the missing observations model

Mathematical Computation June 2013, Volume 2, Issue 2, PP.19-23

Influence Analysis of the Missing Observations Model Baoguang Tian, Chunyan Liang College of Mathematics & Physics, Qingdao University of Science and Technology, Qingdao, 266041, China #Email: tianbaoguangqd@163.com

Abstract The influence of the deletion of data on the observations estimator has been studied according to the fitted value and estimate efficiency point of view in the several data missing model. The inequality relation between W-K statistics and generalized correlation coefficient is built and the equality relation between ratio of variance and zhang’s generalized correlation coefficient is obtained. In addition, the relationship between the ratio of generalized variance and the generalized coefficient of correlation has been discovered defined by Hoetelling. Keywords: Missing Observation Linear Model; W-K Statistics; Ratio of Variance; Generalized Correlation of Coefficient; Estimate of Missing Observations Ratio of Generalized Variance

1 INTRODUCTION Study of experimental data on the influence of the linear model is a problem which has applied value and theoretical value. In article [1-5], Cook and Weisberg as well as many authors have discussed the influence of experimental data on the linear regression model. In this paper, the influence of data on missing observations model has been discussed based on the fitted value and the estimate efficiency of view, and then W-K statistics, ratio variance and ratio of generalized variance were established to discover the relation between them and generalized correlation of coefficient. The general linear model with several missing observations is summarized by Y1  X1  e1 Y2  X 2   e2

Y3  X 3   e3

(1) (2) (3)

where

 ( X 2 )   ( X1)  ( X 3 )  ( X1)  {0}

(4) (5) And Yi is an ni  1 observable responses, X i is an ni  p matrix of known constants, ei is an ni  1 vector of unobservable errors,  is a p  1 vector of unknown parameters. E (ei )  0 , cov(ei , e j )   2ij I ni , i, j  1, 2,3 ,  ij are Kronecker symbols,  ( A) stands for the linear space which is generated by the column vectors of matrix A , Y2 and Y3 are missing observations. In this paper, it is supposed that the rank 0f X 2 is n2 .

Lemma 1 let A , B , U , V be appropriate matrixes and AAUBV  UBV . Then ( A  UBV )  AUB( B  BVAUB) BVA

(6)

where A is the generalized inverse matrix of A . The proof comes from article [6]. The article [7] proved that the estimate Yˆ2 of missing observations Y2 and Yˆ2 satisfied the following equation

Yˆ2  X 2 ( X1X1 ) X 1Y1  X 2 ˆ - 19 www.ivypub.org/mc

(7)

Turn static files into dynamic content formats.

Create a flipbook