| Version: | 0.2-3 | 
| Date: | 2021-07-19 | 
| Title: | Panel Generalized Linear Models | 
| Depends: | R (≥ 2.10), maxLik, plm | 
| Imports: | statmod, Formula | 
| Suggests: | lmtest, car | 
| Description: | Estimation of panel models for glm-like models: this includes binomial models (logit and probit), count models (poisson and negbin) and ordered models (logit and probit), as described in: Baltagi (2013) Econometric Analysis of Panel Data, ISBN-13:978-1-118-67232-7, Hsiao (2014) Analysis of Panel Data <doi:10.1017/CBO9781139839327> and Croissant and Millo (2018), Panel Data Econometrics with R, ISBN:978-1-118-94918-4. | 
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] | 
| URL: | https://cran.r-project.org/package=pglm | 
| NeedsCompilation: | no | 
| Packaged: | 2021-07-19 18:01:36 UTC; yves | 
| Author: | Yves Croissant [aut, cre] | 
| Maintainer: | Yves Croissant <yves.croissant@univ-reunion.fr> | 
| Repository: | CRAN | 
| Date/Publication: | 2021-07-19 18:10:02 UTC | 
Perveived Fairness of Rules for Allocating Seats in Trains and Parking Spaces
Description
observations of 401 individuals
number of observations : 5614
country : France
economic topic : public economics
econometrics topic : ordered response
Usage
data(Fairness)
Format
A dataframe containing :
- id
 the individual index
- answer
 a factor with levels 0 (very unfair), 1 (essentially unfair), 2 (essentially fair) and 3 (very fair)
- good
 one of
'tgv'(French fast train) and'Parking'- rule
 the allocation rule, a factor with levels
'peak','admin','lottery','addsupply','queuing','moral'and'compensation'- driving
 does the individual has the driving license ?
- education
 does the individual has a diploma ?
- recurring
 does the allocation problem is reccuring ?
Source
provided by the authors.
References
Charles Raux, Stephanie Souche and Yves Croissant (2009) “How fair is pricing perceived to be? An empirical study”, Public Choice, 139(1), 227-240.
Health Insurance and Doctor Visits
Description
observations of 401 individuals
number of observations : 20186
country : United States
economic topic : Health Economics
econometrics topic : censored dependant variable
Usage
data(HealthIns)
Format
A time serie containing :
- id
 the individual index
- year
 the year
- med
 medical expenses
- mdu
 number of face-to face medical visits
- coins
 coinsurance rate
- disease
 count of chronic diseases
- sex
 a factor with level
'male'and'female'- age
 the age
- size
 the size of the family
- child
 a factor with levels
'no'and'yes'
Source
Manning, W. G., J. P. Newhouse, N. Duan, E. B. Keeler and A. Leibowitz (1987) “Health Insurance and the Demand for Medical Care: Evidence from a Randomized Experiment”, American Economic Review, 77(3), 251-277.
Deeb P. , and P.K. Trivedi (2002) “The structure of demand for medical care: latent class versus two-part models”, Journal of Health Economics, 21, 601-625..
References
http://cameron.econ.ucdavis.edu/musbook/mus.html.
Patents, R\&d and Technological Spillovers for a Panel of Firms
Description
annual observations of 181 firms from 1983 to 1991
number of observations : 1629
country : world
economic topic : producer behavior
econometrics topic : count data
Usage
data(PatentsRD)
Format
A dataframe containing :
- firm
 firm's id
- year
 year
- sector
 firm's main industry sector, one of aero (aerospace), chem (chemistry), comput (computer), drugs, elec (electricity), food, fuel (fuel and mining), glass, instr (instruments), machin (machinery), metals, other, paper, soft (software), motor (motor vehicules)
- geo
 geographic area, one of eu (European Union), japan, usa, rotw (rest of the world)
- patent
 numbers of European patent applications
- rdexp
 log of R and D expenditures
- spil
 log of spillovers
Source
Cincer, Michele (1997) “Patents, R \& D and technological spillovers at the firm level : some evidence from econometric count models for panel data”, Journal of Applied Econometrics, 12(3), may–june, 265–280.
References
Journal of Applied Econometrics data archive : http://qed.econ.queensu.ca/jae/.
Dynamic Relation Between Patents and R\&d
Description
yearly observations of 346 production units
number of observations : 3460
country : United States
economic topic : industrial economics
econometrics topic : count data
Usage
data(PatentsRDUS)
Format
A dataframe containing :
- cusip
 compustat's identifying number for the firm
- year
 year
- ardssic
 a two-digit code for the applied R&D industrial classification
- scisect
 is the firm in the scientific sector ?
- capital72
 book value of capital in 1972
- sumpat
 the sum of patents applied for between 1972-1979
- rd
 R&D spending during the year (in 1972 dollars)
- patents
 the number of patents applied for during the year that were eventually granted
Source
Hall, Browyn, Zvi Griliches and Jerry Hausman (1986) “Patents and R and D: Is there a Lag?”, International Economic Review, 27, 265-283.
References
http://cameron.econ.ucdavis.edu/racd/racddata.html, chapter 9..
Unionism and Wage Rate Determination
Description
yearly observations of 545 individuals from 1980 to 1987
number of observations : 4360
country : United States
economic topic : labor economics
econometrics topic : endogeneity
Usage
data(UnionWage)
Format
A dataframe containing :
- id
 the individual index
- year
 the year
- exper
 the experience, computed as age - 6 - schooling
- health
 does the individual has health disability ?
- hours
 the number of hours worked
- married
 is the individual married ?
- rural
 does the individual lives in a rural area ?
- school
 years of schooling
- union
 does the wage is set by collective bargaining
- wage
 hourly wage in US dollars
- sector
 one of agricultural, mining, construction, trade, transportation, finance, businessrepair, personalservice, entertainment, manufacturing, pro.rel.service, pub.admin
- occ
 one of proftech, manoffpro, sales, clerical, craftfor, operative, laborfarm, farmlabor, service
- com
 one of black, hisp and other
- region
 the region, one of NorthEast, NothernCentral, South and other
Source
Journal of Applied Econometrics data archive : http://qed.econ.queensu.ca/jae/.
References
Vella, F. and M. Verbeek (1998) “Whose wages do unions raise ? A dynamic model of unionism and wage”, Journal of Applied Econometrics, 13, 163–183.
Panel Estimators for Generalized Linear Models
Description
Estimation by maximum likelihood of glm (binomial and Poisson) and 'glm-like' models (Negbin and ordered) on longitudinal data
Usage
pglm(formula, data, subset, na.action,
     effect = c("individual", "time", "twoways"),
     model = c("random", "pooling", "within", "between"),
     family, other = NULL, index = NULL, start = NULL, R = 20,  ...) 
Arguments
formula | 
 a symbolic description of the model to be estimated,  | 
data | 
 the data: a   | 
subset | 
 an optional vector specifying a subset of observations,  | 
na.action | 
 a function which indicates what should happen when
the data contains '  | 
effect | 
 the effects introduced in the model, one of
  | 
model | 
 one of   | 
family | 
 the distribution to be used,  | 
other | 
 for developper's use only,  | 
index | 
 the index,  | 
start | 
 a vector of starting values,  | 
R | 
 the number of function evaluation for the gaussian quadrature method used,  | 
... | 
 further arguments.  | 
Value
An object of class "pglm", a list with elements:
coefficients | 
 the named vector of coefficients,  | 
logLik | 
 the value of the log-likelihood,  | 
hessian | 
 the hessian of the log-likelihood at convergence,  | 
gradient | 
 the gradient of the log-likelihood at convergence,  | 
call | 
 the matched call,  | 
est.stat | 
 some information about the estimation (time used, optimisation method),  | 
freq | 
 the frequency of choice,  | 
residuals | 
 the residuals,  | 
fitted.values | 
 the fitted values,  | 
formula | 
 the formula (a   | 
expanded.formula | 
 the formula (a   | 
model | 
 the model frame used,  | 
index | 
 the index of the choice and of the alternatives.  | 
Author(s)
Yves Croissant
Examples
## an ordered probit example
data('Fairness', package = 'pglm')
Parking <- subset(Fairness, good == 'parking')
op <- pglm(as.numeric(answer) ~ education + rule,
           Parking[1:105, ],
           family = ordinal('probit'), R = 5, print.level = 3,
           method = 'bfgs', index = 'id',  model = "random")
## a binomial (probit) example
data('UnionWage', package = 'pglm')
anb <- pglm(union ~ wage + exper + rural, UnionWage, family = binomial('probit'),
            model = "pooling",  method = "bfgs", print.level = 3, R = 5)
## a gaussian example on unbalanced panel data
data(Hedonic, package = "plm")
ra <- pglm(mv ~ crim + zn + indus + nox + age + rm, Hedonic, family = gaussian,
           model = "random", print.level = 3, method = "nr", index = "townid")
## some count data models
data("PatentsRDUS", package="pglm")
la <- pglm(patents ~ lag(log(rd), 0:5) + scisect + log(capital72) + factor(year), PatentsRDUS,
           family = negbin, model = "within", print.level = 3, method = "nr",
           index = c('cusip', 'year'))
la <- pglm(patents ~ lag(log(rd), 0:5) + scisect + log(capital72) + factor(year), PatentsRDUS,
           family = poisson, model = "pooling", index = c("cusip", "year"),
           print.level = 0, method="nr")
## a tobit example
data("HealthIns", package="pglm")
HealthIns$med2 <- HealthIns$med / 1000
HealthIns2 <- HealthIns[-2209, ]
set.seed(2)
subs <- sample(1:20186, 200, replace = FALSE)
HealthIns2 <- HealthIns2[subs, ]
la <- pglm(med ~ mdu + disease + age, HealthIns2,
           model = 'random', family = 'tobit', print.level = 0,
           method = 'nr', R = 5)