出版时间:2009-6 出版社:王济川、谢海义、 费舍余 (Fisher.J.) 高等教育出版社 (2009-06出版) 作者:王济川,谢海义,(美) 费舍余 著 页数:264
Tag标签:无
前言
Interest in multilevel statistical models for social science and public health studies has been aroused dramatically since the mid-1980s. New multilevel modeling techniques are giving researchers tools for analyzing data that have a hierarchical or clustered structure. Multilevel models are now applied to a wide range of studies in sociology, population studies, education studies, psychology, economics, epidemiology, and public health.Individuals and social contexts (e.g., communities, schools, organizations, or geographic locations) to which individuals belong are conceptualized as a hierarchical system, in which individuals are micro units and contexts are macro units. Research interest often centers on whether and how individual outcome varies across contexts, and how the variation is explained by contextual factors; what and how the relationships between the outcome measures and individual characteristics vary across contexts, and how the relationships are influenced or moderated by contextual factors. To address these questions, studies often employ data collected from more than one level of observation units, i.e., observations are collected at both an individual level (e.g., students) and one or more contextual levels (e.g., schools, cities). As a result, the data are characterized by a hierarchical structure in which individuals are nested within units at the higher levels. This kind of data is called hierarchically structured data or multilevel data. The conventional single-level statistical methods, such as ordinary least square(OLS) regression are inappropriate for analysis of multilevel data because observations are nonindependent and the contextual effects cannot be addressed appropriately in such models. Multilevel modeling not only takes into account observation dependence in the multilevel data, but also provides a more meaningful conceptual framework by allowing assessment of both individual and contextual effects, as well as cross-level interaction effects.This book covers a broad range of topics about multilevel modeling. Our goal is to help students and researchers who are interested in analysis of multilevel data to understand the basic concepts, theoretical frameworks and application methods of multilevel modeling. This book is written in non-mathematical terms, focusing on the methods and application of various multilevel models, using the internationally widely used statistical software, the Statistics Analysis System (SAS). Examples are drawn from analysis of real-world research data. We focus on twolevel models in this book because it is most frequently encountered situation in real research. These models can be readily expanded to models with three or more levels when applicable. A wide range of linear and non-linear multilevel models are introduced and demonstrated.
内容概要
Multilevel Models: Appfications Using SAS is written in nontechnical terms, focuses on the methods and applications of various multilevel models, including liner multilevel models,multilevel logistic regression models, multilevel Poisson regression models, multilevel negative binomial models, as well as some cutting-edge applications, such as multilevel zero-inflated Poisson (ZIP) model, random effect zero-inflated negative binomial model (RE-ZINB), mixed-effect mixed-distribution models, bootstrapping multilevel models, and group-based trajectory models. Readers will learn to build and apply multilevel models for hierarchically structured cross-sectional data and longitudinal data using the internationally distributed software package Statistics Analysis System (SAS). Detailed SAS syntax and output are provided for model applications, providing students, research scientists and data analysts with ready templates for their applications.
作者简介
作者:王济川 谢海义 (美国) 费舍余 (Fisher.J.) Dr. Jichuan Wang is a professor in the Center for Clinical and Community Research, the Children's National Medical Center, the George Washington Universty School of Medicine.Dr. Haiyi Xie is an associate professor of Community and Family Medicine, Dartmouth Medical School, Dartmouth College.Dr. James Henry Fisher is a senior planner for the HancockCounty Planning Commission in EIIsworth, Maine.
书籍目录
Chapter 1 Introduction1.1 Conceptual framework of multilevel modeling1.2 Hierarchically structured data1.3 Variables in multilevel data1.4 Analytical problems with multilevel data1.5 Advantages and limitations of multilevel modeling1.6 Computer software for multilevel modelingChapter 2 Basics of Linear Multilevel Models2.1 Intraclass correlation coefficient (ICC)2.2 Formulation of two-level multilevel models2.3 Model assumptions2.4 Fixed and random regression coefficients2.5 Cross-level interactions2.6 Measurement centering2.7 Model estimation2.8 Model fit, hypothesis testing, and model comparisons2.8.1 Model fit2.8.2 Hypothesis testing2.8.3 Model comparisons2.9 Explained level-1 and level-2 variances2.10 Steps for building multilevel models2.11 Higher-level multilevel modelsChapter 3 Application of Two-level Linear Multilevel Models3.1 Data3.2 Empty model3.3 Predicting between-group variation3.4 Predicting within-group variation3.5 Testing random level-1 slopes3.6 Across-level interactions3.7 Other issues in model developmentChapter 4 Application of Multilevel Modeling to Longitudinal Data4.1 Features of longitudinal data4.2 Limitations of traditional approaches for modeling longitudinal data4.3 Advantages of multilevel modeling for longitudinal data4.4 Formulation of growth models4.5 Data description and manipulation4.6 Linear growth models4.6.1 The shape of average outcome change over time4.6.2 Random intercept growth models4.6.3 Random intercept and slope growth models4.6.4 Intercept and slope as outcomes4.6.5 Controlling for individual background variables in models4.6.6 Coding time score4.6.7 Residual variance/covariance structures4.6.8 Time-varying covariates4.7 Curvilinear growth models4.7.1 Polynomial growth model4.7.2 Dealing with collinearity in higher order polynomial growth model4.7.3 Piecewise (linear spline) growth modelChapter 5 Multilevel Models for Discrete Outcome Measures5.1 Introduction to generalized linear mixed models5.1.1 Generalized linear models5.1.2 Generalized linear mixed models5.2 SAS Procedures for multilevel modeling with discrete outcomes5.3 Multilevel models for binary outcomes5.3.1 Logistic regression models5.3.2 Probit models5.3.3 Unobserved latent variables and observed binary outcome measures5.3.4 Multilevel logistic regression models5.3.5 Application of multilevel logistic regression models5.3.6 Application of multilevel logit models to longitudinal data5.4 Multilevel models for ordinal outcomes5.4.1 Cumulative logit models5.4.2 Multilevel cumulative logit models5.5 Multilevel models for nominal outcomes5.5.1 Multinomial logit models5.5.2 Multilevel multinomial logit models5.5.3 Application of multilevel multinomial logit models5.6 Multilevel models for count outcomes5.6.1 Poisson regression models5.6.2 Poisson regression with over-dispersion and a negative binomial model5.6.3 Multilevel Poisson and negative binomial models5.6.4 Application of multilevel Poisson and negative binomial modelsChapter 6 Other Applications of Multilevel Modeling and Related Issues6.1 Multilevel zero-inflated models for count data with extra zeros6.1.1 Fixed-effect ZIP model6.1.2 Random effect zero-inflated Poisson (RE-ZIP) models6.1.3 Random effect zero-inflated negative binomial (RE-ZINB) models6.1.4 Application of RE-ZIP and RE-ZINB models6.2 Mixed-effect mixed-distribution models for semi-continuous outcomes6.2.1 Mixed-effects mixed distribution model6.2.2 Application of the Mixed-Effect mixed distribution model6.3 Bootstrap multilevel modeling6.3.1 Nonparametric residual bootstrap multilevel modeling6.3.2 Parametric residual bootstrap multilevel modeling6.3.3 Application of nonparametric residual bootstrap multilevel modeling6.4 Group-based models for longitudinal data analysis6.4.1 Introduction to group-based model6.4.2 Group-based logit model6.4.3 Group-based zero-inflated Poisson (ZIP) model6.4.4 Group-based censored normal models6.5 Missing values issue6.5.1 Missing data mechanisms and their implications6.5.2 Handling missing data in longitudinal data analyses6.6 Statistical power and sample size for multilevel modeling6.6.1 Sample size estimation for two-level designs6.6.2 Sample size estimation for longitudinal data analysisReferenceIndex
章节摘录
插图:In the linear model case, this integral can be solved in closed form, and the resulting likelihood or restricted likelihood can be maximized directly. For nonlinear multilevel models, however, the integral is usually unknown and must be approximated. Many methods have been proposed for such maximization approximation. Two basic methods are: 1) linearizati'on, which approximates the integrated likelihood function using techniques such as Taylor series expansion, 2) integral approximation with numerical methods. These approaches are implemented in two SAS procedures, PROC GLIMMIX and PROC NLMIXED and two macros, %GLIMMIX and %NLMIXED, respectively.Prior to the current version of SAS (SAS 9.2) (SAS Institute Inc., 2008), PROC GLIMMIX is solely based on linearization methods. In version 9.2 of PROC GLIMMIX, linearization is the default estimation method, and two numerical integration methods——Laplace approximation method and adaptive Gauss-Hermite quadrature have been added as options. The linearization method is also called a pseudo-likelihood method, in which pseudo-data are generated from the original data, and likelihood function is approximated using Taylor series expansions (Schabenberger, 2005). The essential idea of the linearization method is to approximate GLMM using normal linear mixed model estimates repeatedly. Among the various linearization methods available in the procedure, the default method is the restricted or residual pseudo-likelihood (REPL) (Wolfinger & O'Connell, 1993). The maximization of the pseudo-likelihood can be carried out by various optimization techniques in PROC GLIMMIX. The default optimization technique is the Newton-Raphson algorithm.The major advantages of linearization-based methods include: First, they can fit models for which the joint distribution is difficult or impossible to ascertain. Second, compared with numerical integration methods, they allow a larger number of random effects to be estimated in the model. Third, the variance/covariance structure of the level-1 residual matrix (i.e., R matrix) can be readily accommodated. Fourth, the model is iteratively estimated based on the linear mixed model, thus both ML and REML are available for model estimation (Schabenberger, 2005). In addition, in our experience, linearization based models are much faster to run.The disadvantages of linearization-based methods include: First, they are based on iterative model estimation using pseudo-data constructed from the original data; as such, they do not have a real likelihood, and therefore -2LL or deviance statistic cannot be used for model comparisons. Second, PROC GLIMMIX does not support a broad array of variance/covariance structures of the R matrix that you can draw on with the PROC MIXED procedure (Schabenberger, 2005).
编辑推荐
《多层统计分析模型:SAS与应用》是由高等教育出版社出版的。
图书封面
图书标签Tags
无
评论、评分、阅读与下载