| Version: | 0.2-3 | 
| Date: | 2021-07-19 | 
| Title: | Panel Generalized Linear Models | 
| Depends: | R (≥ 2.10), maxLik, plm | 
| Imports: | statmod, Formula | 
| Suggests: | lmtest, car | 
| Description: | Estimation of panel models for glm-like models: this includes binomial models (logit and probit), count models (poisson and negbin) and ordered models (logit and probit), as described in: Baltagi (2013) Econometric Analysis of Panel Data, ISBN-13:978-1-118-67232-7, Hsiao (2014) Analysis of Panel Data <doi:10.1017/CBO9781139839327> and Croissant and Millo (2018), Panel Data Econometrics with R, ISBN:978-1-118-94918-4. | 
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] | 
| URL: | https://cran.r-project.org/package=pglm | 
| NeedsCompilation: | no | 
| Packaged: | 2021-07-19 18:01:36 UTC; yves | 
| Author: | Yves Croissant [aut, cre] | 
| Maintainer: | Yves Croissant <yves.croissant@univ-reunion.fr> | 
| Repository: | CRAN | 
| Date/Publication: | 2021-07-19 18:10:02 UTC | 
Perveived Fairness of Rules for Allocating Seats in Trains and Parking Spaces
Description
observations of 401 individuals
number of observations : 5614
country : France
economic topic : public economics
econometrics topic : ordered response
Usage
data(Fairness)Format
A dataframe containing :
- id
- the individual index 
- answer
- a factor with levels 0 (very unfair), 1 (essentially unfair), 2 (essentially fair) and 3 (very fair) 
- good
- one of - 'tgv'(French fast train) and- 'Parking'
- rule
- the allocation rule, a factor with levels - 'peak',- 'admin',- 'lottery',- 'addsupply',- 'queuing',- 'moral'and- 'compensation'
- driving
- does the individual has the driving license ? 
- education
- does the individual has a diploma ? 
- recurring
- does the allocation problem is reccuring ? 
Source
provided by the authors.
References
Charles Raux, Stephanie Souche and Yves Croissant (2009) “How fair is pricing perceived to be? An empirical study”, Public Choice, 139(1), 227-240.
Health Insurance and Doctor Visits
Description
observations of 401 individuals
number of observations : 20186
country : United States
economic topic : Health Economics
econometrics topic : censored dependant variable
Usage
data(HealthIns)Format
A time serie containing :
- id
- the individual index 
- year
- the year 
- med
- medical expenses 
- mdu
- number of face-to face medical visits 
- coins
- coinsurance rate 
- disease
- count of chronic diseases 
- sex
- a factor with level - 'male'and- 'female'
- age
- the age 
- size
- the size of the family 
- child
- a factor with levels - 'no'and- 'yes'
Source
Manning, W. G., J. P. Newhouse, N. Duan, E. B. Keeler and A. Leibowitz (1987) “Health Insurance and the Demand for Medical Care: Evidence from a Randomized Experiment”, American Economic Review, 77(3), 251-277.
Deeb P. , and P.K. Trivedi (2002) “The structure of demand for medical care: latent class versus two-part models”, Journal of Health Economics, 21, 601-625..
References
http://cameron.econ.ucdavis.edu/musbook/mus.html.
Patents, R\&d and Technological Spillovers for a Panel of Firms
Description
annual observations of 181 firms from 1983 to 1991
number of observations : 1629
country : world
economic topic : producer behavior
econometrics topic : count data
Usage
data(PatentsRD)Format
A dataframe containing :
- firm
- firm's id 
- year
- year 
- sector
- firm's main industry sector, one of aero (aerospace), chem (chemistry), comput (computer), drugs, elec (electricity), food, fuel (fuel and mining), glass, instr (instruments), machin (machinery), metals, other, paper, soft (software), motor (motor vehicules) 
- geo
- geographic area, one of eu (European Union), japan, usa, rotw (rest of the world) 
- patent
- numbers of European patent applications 
- rdexp
- log of R and D expenditures 
- spil
- log of spillovers 
Source
Cincer, Michele (1997) “Patents, R \& D and technological spillovers at the firm level : some evidence from econometric count models for panel data”, Journal of Applied Econometrics, 12(3), may–june, 265–280.
References
Journal of Applied Econometrics data archive : http://qed.econ.queensu.ca/jae/.
Dynamic Relation Between Patents and R\&d
Description
yearly observations of 346 production units
number of observations : 3460
country : United States
economic topic : industrial economics
econometrics topic : count data
Usage
data(PatentsRDUS)Format
A dataframe containing :
- cusip
- compustat's identifying number for the firm 
- year
- year 
- ardssic
- a two-digit code for the applied R&D industrial classification 
- scisect
- is the firm in the scientific sector ? 
- capital72
- book value of capital in 1972 
- sumpat
- the sum of patents applied for between 1972-1979 
- rd
- R&D spending during the year (in 1972 dollars) 
- patents
- the number of patents applied for during the year that were eventually granted 
Source
Hall, Browyn, Zvi Griliches and Jerry Hausman (1986) “Patents and R and D: Is there a Lag?”, International Economic Review, 27, 265-283.
References
http://cameron.econ.ucdavis.edu/racd/racddata.html, chapter 9..
Unionism and Wage Rate Determination
Description
yearly observations of 545 individuals from 1980 to 1987
number of observations : 4360
country : United States
economic topic : labor economics
econometrics topic : endogeneity
Usage
data(UnionWage)Format
A dataframe containing :
- id
- the individual index 
- year
- the year 
- exper
- the experience, computed as age - 6 - schooling 
- health
- does the individual has health disability ? 
- hours
- the number of hours worked 
- married
- is the individual married ? 
- rural
- does the individual lives in a rural area ? 
- school
- years of schooling 
- union
- does the wage is set by collective bargaining 
- wage
- hourly wage in US dollars 
- sector
- one of agricultural, mining, construction, trade, transportation, finance, businessrepair, personalservice, entertainment, manufacturing, pro.rel.service, pub.admin 
- occ
- one of proftech, manoffpro, sales, clerical, craftfor, operative, laborfarm, farmlabor, service 
- com
- one of black, hisp and other 
- region
- the region, one of NorthEast, NothernCentral, South and other 
Source
Journal of Applied Econometrics data archive : http://qed.econ.queensu.ca/jae/.
References
Vella, F. and M. Verbeek (1998) “Whose wages do unions raise ? A dynamic model of unionism and wage”, Journal of Applied Econometrics, 13, 163–183.
Panel Estimators for Generalized Linear Models
Description
Estimation by maximum likelihood of glm (binomial and Poisson) and 'glm-like' models (Negbin and ordered) on longitudinal data
Usage
pglm(formula, data, subset, na.action,
     effect = c("individual", "time", "twoways"),
     model = c("random", "pooling", "within", "between"),
     family, other = NULL, index = NULL, start = NULL, R = 20,  ...) 
Arguments
| formula | a symbolic description of the model to be estimated, | 
| data | the data: a  | 
| subset | an optional vector specifying a subset of observations, | 
| na.action | a function which indicates what should happen when
the data contains ' | 
| effect | the effects introduced in the model, one of
 | 
| model | one of  | 
| family | the distribution to be used, | 
| other | for developper's use only, | 
| index | the index, | 
| start | a vector of starting values, | 
| R | the number of function evaluation for the gaussian quadrature method used, | 
| ... | further arguments. | 
Value
An object of class "pglm", a list with elements:
| coefficients | the named vector of coefficients, | 
| logLik | the value of the log-likelihood, | 
| hessian | the hessian of the log-likelihood at convergence, | 
| gradient | the gradient of the log-likelihood at convergence, | 
| call | the matched call, | 
| est.stat | some information about the estimation (time used, optimisation method), | 
| freq | the frequency of choice, | 
| residuals | the residuals, | 
| fitted.values | the fitted values, | 
| formula | the formula (a  | 
| expanded.formula | the formula (a  | 
| model | the model frame used, | 
| index | the index of the choice and of the alternatives. | 
Author(s)
Yves Croissant
Examples
## an ordered probit example
data('Fairness', package = 'pglm')
Parking <- subset(Fairness, good == 'parking')
op <- pglm(as.numeric(answer) ~ education + rule,
           Parking[1:105, ],
           family = ordinal('probit'), R = 5, print.level = 3,
           method = 'bfgs', index = 'id',  model = "random")
## a binomial (probit) example
data('UnionWage', package = 'pglm')
anb <- pglm(union ~ wage + exper + rural, UnionWage, family = binomial('probit'),
            model = "pooling",  method = "bfgs", print.level = 3, R = 5)
## a gaussian example on unbalanced panel data
data(Hedonic, package = "plm")
ra <- pglm(mv ~ crim + zn + indus + nox + age + rm, Hedonic, family = gaussian,
           model = "random", print.level = 3, method = "nr", index = "townid")
## some count data models
data("PatentsRDUS", package="pglm")
la <- pglm(patents ~ lag(log(rd), 0:5) + scisect + log(capital72) + factor(year), PatentsRDUS,
           family = negbin, model = "within", print.level = 3, method = "nr",
           index = c('cusip', 'year'))
la <- pglm(patents ~ lag(log(rd), 0:5) + scisect + log(capital72) + factor(year), PatentsRDUS,
           family = poisson, model = "pooling", index = c("cusip", "year"),
           print.level = 0, method="nr")
## a tobit example
data("HealthIns", package="pglm")
HealthIns$med2 <- HealthIns$med / 1000
HealthIns2 <- HealthIns[-2209, ]
set.seed(2)
subs <- sample(1:20186, 200, replace = FALSE)
HealthIns2 <- HealthIns2[subs, ]
la <- pglm(med ~ mdu + disease + age, HealthIns2,
           model = 'random', family = 'tobit', print.level = 0,
           method = 'nr', R = 5)