We provide a codebook for the EOIR Case dataset. The codebook is a work in progress; there are many things we do not understand in the data, and some of our educated guesses here may be mistaken. In many cases, we draw directly on EOIR’s code key, but we recommend that users cross-reference both. We will continue to update the codebook as we learn more, and we welcome feedback and corrections. We currently do not provide documentation of the fields in the motions or pro bono tables.

Data structure

The EOIR CASE dataset includes multiple tables that are linked together through a series of unique identifiers. Four points are important to understand:

The government provides each respondent a unique A-number.
An individual respondent may have multiple cases in the EOIR database. Each case type (e.g., credible fear or removal) requires a different case id number (idncase).
Each case may include multiple proceedings that require unique proceeding id numbers (idnproceeding).
During each proceeding an immigration judge may schedule multiple hearings (idnschedule), during which she administers the proceeding. Judges adjourn hearings for a unique set of reasons, each of which has a special code.

Tables

Fields (variables)

We describe the fields (a.k.a. variables or columns) below. We include the name of each field, a description, and the type of data in the field (e.g., string, numeric, date). Expanding a row will show an indicator for whether the field is available in each table and the proportion missing.