ICE data documentation

Codebook

Description of the tables and fields (variables) in the ICE data on administrative arrests, detainers, detentions, encounters, and removals.

Read

Data processing

Steps and code used to create processed datasets for the most recent ICE release.

Read

Documents

Agency documentation of ICE datasets.

Explore

Webinar: Introduction to the ICE data

The webinar was held on December 3, 2025.

Frequently asked questions about the ICE data

What ICE data has the Deportation Data Project posted?

Most of the data we post is original data that we received from ICE. We have posted original individual-level data on: arrests, also called apprehensions; detainers, which are requests that ICE makes to local jails and prisons to hold individuals for transfer to ICE custody; detentions, which are records of individuals held in ICE custody; encounters, which are records of individuals who ICE either encountered in person or searched for electronically; and removals, which are records of deportations conducted by ICE.

In addition to these original datasets, which we have posted without alterations, we are working to post processed, easier-to-use versions of the data. First, we have posted close-to-original versions of the arrests, removals and detainers datasets, altered only to flag possible duplicates and make dates easier to use. Second, we have posted a simplified version of the ICE detentions data that includes just one row for each person’s stay in detention; details here.

What is individual-level data?

We seek and post data that is at the individual level. That means the datasets contain a row for each individual immigration enforcement action (such as arrest or removal). This allows the user to conduct analysis–for example, to count the number of arrests that occurred in a given area of responsibility. We do not conduct analysis ourselves (although we hope to have the capacity to do so in the future). That means that we do not provide counts, reports, or summary statistics. Instead, if you see counts or summary statistics attributed to the Deportation Data Project, that means someone conducted analysis of the data on our website, either by downloading the full dataset or by using our filtering dataset to download a part of the data. We are also working to post simplified versions of the data; the first of these is an ICE detentions dataset with just one row for each individual’s stay in detention (rather than a separate row for each book-in to a different detention center).

Where does the ICE data come from?

The data comes from ICE itself in response to FOIA requests. In most cases, we provide the raw data ICE provided without modification. We note any time we have modified the data; we aim to clearly distinguish between raw data and the datasets that we have processed/simplified.

How did the Deportation Data Project obtain these data?

We obtained these data through a Freedom of Information Act request filed by the Center for Immigration Law and Policy with our assistance. After ICE did not respond to the request for six months, CILP filed a lawsuit in the U.S. District Court for the Central District of California in December 2024 to compel the release of the data. CILP is represented in the litigation by members of our team. The first data release under CILP v ICE was produced by ICE in March 2025, and to date we have received three more updates in early June, late June, and late July.

How should I cite ICE data from the Deportation Data Project?

We recommend: “government data provided by ICE in response to a FOIA request to the Deportation Data Project and analyzed by [your organization].”

When will you update these datasets?

We promptly post data that we receive from ICE, but ICE has not agreed to release these datasets on any schedule, so it is impossible to predict when, or at what intervals, we will receive updates. We are actively seeking updates.

How can I access the data?

You can access the data in the tables below. Each table has a “Download” column with a link to download the data as an Excel file. The most recent release is also available as a ZIP file with all of the tables together. You can open them directly in Microsoft Excel or another spreadsheet program or read them into a statistical software such as R or Stata.

Do you have a data dictionary or codebook?

Yes, we compiled what we know about the ICE data in a codebook. Our understanding is very incomplete. More generally, our data guide provides an overview of US immigration enforcement data across the government.

Can I filter arrests data to my state, city, or neighborhood?

Yes, but imperfectly. Three variables (columns) may be useful: “Area of Responsibility,” “Landmark,” and “State.” Each is useful, but also incomplete. The state variable is accurate, but sometimes missing. The Area of Responsibility variable, which represents the coverage area of an ICE field office, is geographically coarse; some areas encompass very large regions. ICE provides some information on the coverage areas. The Landmark refers to a place near the arrest and is sometimes the most geographically-specific, but it is inconsistently used. You can find more location information by joining the arrests table with the detainers and detentions tables. If someone is arrested after having a detainer lodged against them, the detainers dataset includes the name of the jail or prison to which the detainer request was sent. And if someone is booked into ICE detention after arrest, the detention center that the person was first booked into offers some clues about where the person was arrested.

I saw that there is no removals dataset in your most recent release, and I remember that you warned about the reliability of previous versions of the removals data. How can I get the best picture of deportations?

We have not posted removals data from the latest update because we have doubts about its reliability. These issues are part of continuing difficulties with the removals data, which we hoped were resolved in the late July data release. (Read why we posted that dataset and believe it is complete.) In the meantime, the best ways to count removals are to identify individuals (1) in the arrests dataset who have a departed date list and (2) identify individuals in the detentions data whose final release reason is “Remove.” These methods will omit some removals that ICE conducts where an individual was never arrested or detained by ICE—for example, if that person is arrested by Border Patrol and removed by ICE without first being booked in to ICE detention. Still, these methods should offer a reasonable picture of trends over time in ICE removals.

I noticed that there are some duplicates in the arrests data. How should I account for those?

No approach should change estimates dramatically, since a small percentage of rows involve potential duplicates. There are several types of potential duplicates in the data and multiple reasonable approaches to resolving them. We flag potential duplicates—repeated rows for the same individual in the same 24-hour period—in our processed dataset. Some of these may reflect actual repeated arrests on the same day, which seems unlikely but conceivable. Three types in particular are worth mentioning. First, many of the duplicates involve a row with a case status that reads “E-Charging Document Canceled by ICE”; these seem likely to be duplicates, but it’s not always clear which row to retain, since in some cases the row with the “E-Charging Document Canceled by ICE” status includes more information (i.e. sometimes that row includes an apprehension landmark, whereas the other row for the same person does not). Second, many rows are identical across all fields apart from the time stamp; for these rows, we think it would be reasonable to choose just the later row. Third, many rows have not only identical dates and unique IDs, but also identical times stamps. It also seems safe to assume that these are duplicates. In sum, there are several ways to screen for duplicates, and correct choices are not obvious. Luckily, no choice should lead to large differences in estimates.

How can I identify courtroom arrests in the data?

Unfortunately we do not know of a good way to identify arrests at immigration courthouses in the ICE data. It may be possible to identify some courthouse arrests in the EOIR data by cross-referencing detention dates and immigration court hearing dates, but that approach may not capture all courthouse arrests.

How can I identify raids in communities in the data, as opposed to arrests at check-ins or a jail or prison?

It is not possible to fully isolate arrests that take place in communities (as opposed to within jails or prisons, for example). However, there are two indicators that may be useful: in the arrests table when “Apprehension Method” is “Located” or “Non-Custodial Arrest” we think that these records are more likely to indicate arrests in the community.

How can I tell whether detainers are honored by local and state jails and prisons?

There does not appear to be any way to determine from these data whether a jail or prison is holding individuals for up to 48 hours in response to a detainer request. However, there are ways to determine whether an individual is booked into ICE detention following a detainer request. The “Detainer Lift Reason” field in the detainers table includes values that likely represent detainer refusals (“Detainer Declined by LEA”) and some that may represent accepting detainers (“Booked into Detention”). However, that field is often missing. If the field is missing, it may mean that the person remains in criminal custody, the detainer was not honored, or that ICE has not yet updated the record to indicate whether it was honored. A second way to confirm whether the individual was transferred to ICE custody following a detainer request is to join the detainers table to the detentions table by unique ID. If the unique ID in the detainers table does not appear in the detentions table, it is possible that the detainer was not honored. Note, however, that the individual may still be in criminal custody. If the unique ID does appear in the detentions table, that means the individual was booked into ICE custody following a detainer request.

Do these data include all immigration arrests, detentions, and removals by the US government?

No, they only include actions by ICE Enforcement and Removal Operations (ERO). ICE ERO is generally responsible for civil immigration arrests in the interior of the United States, away from international borders (Austin Kocher’s Substack discusses the ICE arrests data in detail). Customs and Border Protection (CBP) conducts arrests and detentions at or near the border. Some people arrested by CBP are transferred for detention and removal by ICE. CBP also refuses entry and removes people deemed inadmissible at the border. We post data from CBP on arrests (encounters) and people deemed inadmissible at the border. CBP has not released data as recently as ICE has.

How can I identify removals to third countries?

Every table has a column for “Departed Country,” which indicates where individuals were removed to. To identify third-country removals in which a noncitizen was deported to a country other than their country of citizenship, compare those countries to the “Citizenship Country” and/or the “Birth Country” column. The “Citizenship Country” may not include all nationalities in the case of dual citizenship and, as with all data, errors are possible.

Why would data for the same individual change between releases?

ICE appears to update records retroactively in a relatively small number of cases, including by changing the arrests, encounters, detainers, and detentions tables when a removal takes place. This may result in slightly different patterns in overlapping periods of two data releases. We do not know whether there is a schedule or systematic procedure dictating when these updates occur.

Why are there many rows per person in the detention table?

Each row in the detentions table represents time in a specific detention facility from book-in to book-out. A person arrested by ICE might be transferred to multiple facilities during their detention, represented in multiple rows. Overall, ICE refers to the whole detention period (from book-in to the first detention to book-out from the last detention center) as a “stay.” A stay often includes multiple book-ins to different detention centers, and one person (identified anonymously by unique ID) can have multiple stays (if released from detention and later detained again). We have posted a simplified version of the dataset that includes just one row for each stay. See our ICE codebook and detention data processing documentation for further explanation of the raw and processed detentions data, respectively.

Why does it appear that the data is missing arrests/encounters/detainers/detentions/removals for some months?

Some of the spreadsheets are split over multiple sheets. The sheets should be stacked before being analyzed.