10/01/2024
Blog
TransUnion regularly works with researchers at universities, non-profits and government agencies to deliver de-identified credit data to enable their economic and population research. This kind of data opens significant opportunities for population sampling, understanding consumer patterns of decision-making, and evaluating financial outcomes across different moments in time. However, working with credit data and navigating the breadth and depth of information can be challenging.
Credit data is reported to TransUnion from a wide network of data furnishers. TransUnion’s database of credit histories (covering more than 260 million US adults) is supported by a network of over 85,000 national, regional and local furnishers — which includes finance companies, banks, credit unions, retailers, student loan providers, debt buyers, and more. We require all data furnishers to provide data that’s complete and accurate in accordance with the FCRA and all applicable state credit-reporting laws, and in adherence to the Metro 2® Reporting Guidelines maintained by the Credit Data Industry Association.
The atomic unit of credit data is the account, and accounts can be classified as tradelines or collections.
Consumers’ credit reports are a compilation of the different accounts associated with that consumer — as reported by these furnishers at a particular moment in time. TransUnion’s consumer reporting database also includes personally identifiable information (PII) — which is never shared for use in research.
Data furnishers provide various levels of credit account information, including financial details, descriptors and remark codes.
Most consumers are familiar with the basic fields shown on their credit reports which capture the financial details of an account at a given moment in time (usually the end of the account’s billing cycle) and includes data points like:
Data furnishers also report static properties of each account that do not typically change from month to month, such as when the account was opened or who the lender is. More important for a research audience are three key pieces of data required by the Metro 2 guidelines that provide a flexible framework for capturing the structural details of a given financial obligation:
Researchers can flexibly use these fields for a view of consumer credit obligations based on project needs. For example, the right combination of values would allow a researcher to sample from consumers who are authorized users on a private label, retailer-specific credit card issued to another consumer without a personal, bank-issued credit card of their own.
Occasionally, data furnishers will append ‘remarks’ to explain a special condition related to an account. The dates when remark codes are added or removed are also maintained in an account’s history. These remarks cover a variety of circumstances, including:
This information can aid researchers in identifying consumers who’ve had (and particular points in time when) special circumstances occurred, such as reporting a credit card lost or stolen, or the date an account was first flagged as having payment assured by wage garnishment.
In our experience, researchers use depersonalized, consumer-level data rather than account-level data. In addition to maintaining account records, TransUnion calculates consumer-level attributes, algorithms and scores derived from each consumer’s account information on a regular basis. These provide a streamlined way for researchers to operationalize credit data for model development, hypothesis testing and analytic inquiry compared to raw tradeline data.
To facilitate longitudinal economic and population research, TransUnion maintains regular snapshots of each consumer’s tradeline data and derived attributes, algorithms and scores, called “Archives.” Archives are typically created at the end of each month, allowing researchers to view how different aspects of consumers’ credit profiles change over time. These depersonalized archives can be provided for a sample of consumers where the date is the same for each individual (e.g., an entire study population as of December 31, 2022), or where the date varies based on a “benchmark date” associated with each consumer in the sample (e.g., at the end of the month, one year before X — where X is different for each consumer).
We take a consultative approach to working with researchers interested in our data. We’ve worked with experts in a variety of fields of study, including economics, public health, government policy and marketing research. Our team can provide valuable insights and best practices for experimental design, sampling from our national credit database, selecting the right attributes for impact assessments or model building, and more.