PII Reference Guide

Category: Knowledge Base | Settings - Audit

This document outlines the types of data checks, and their methodology, for personal identifiable information (PII) when conducting an audit – including the Current Status checks. The Global defaults are always checked and cannot be disabled. Country specific checks are set optionally in your audit settings.

The Artificial Intelligence algorithms used for detecting PII are constantly evolving and this document is updated regularly.

 

 Global Defaults (always checked)


CREDIT_CARD_NUMBER 

credit card number is 12 to 19 digits long. They are used for payment transactions globally.

Detection method: Pattern match and checksum

EMAIL_ADDRESS

An email address indicates the mailbox that emails are sent to or from. The maximum length of the domain name is 255 characters, and the maximum length of the local-part is 64 characters.

Detection method: Pattern and top level domain validation

IBAN_CODE

An International Bank Account Number (IBAN) is defined as an internationally agreed-upon method for identifying bank accounts. It’s defined by the International Standard of Organization (ISO) 13616:2007 standard. ISO 13616:2007 was created by the European Committee for Banking Standards (ECBS). An IBAN consists of up to 34 alphanumeric characters including elements such as a country code or account number.

Detection method: Pattern match and checksum

ICD9_CODE

The International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) lexicon is used to assign diagnostic and procedure codes associated with inpatient, outpatient, and physician office use in the United States. It was created by the US National Center for Health Statistics (NCHS). The ICD-9-CM is based on the ICD-9 but provides for additional morbidity detail. It’s updated annually on October 1.

Detection method: Word and phrase list

ICD10_CODE

Like ICD-9-CM codes, the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) lexicon is a series of diagnostic codes published by the World Health Organization (WHO) to describe causes of morbidity and mortality.

Detection method: Word and phrase list

IMEI_HARDWARE_ID

An International Mobile Equipment Identity (IMEI) hardware identifier, used to identify mobile phones.

Detection method: Custom Logic, pattern match and context.

IP_ADDRESS

An Internet Protocol (IP) address (either IPv4 or IPv6).

Detection method: Custom Logic, pattern match and context.

MAC_ADDRESS,

MAC_ADDRESS_LOCAL

media access control address (MAC address), which is an identifier for a network adapter.Detection method: Custom logic, pattern match and context

Context:

  • mac address
  • hardware address
  • physical address
  • hwaddr
  • ether
  • ethernet
  • BSSID
PHONE_NUMBER

telephone number or US toll-free telephone number.

Detection method: Custom logic, pattern match and context

SWIFT_CODE

SWIFT code is the same as a Bank Identifier Code (BIC). It’s a unique identification code for a particular bank. These codes are used when transferring money between banks, particularly for international wire transfers. Banks also use the codes for exchanging other messages.

Detection method: Pattern match and context

Context:

  • SWIFT
  • ISO 9362
  • Business Identifier Code
  • BIC
  • Business Entity Identifier
  • BEI
  • bank
  • interbank

 Australia Specific (optional)


AUSTRALIA_MEDICARE_NUMBER 

A 9-digit Medicare account number is issued to permanent residents of Australia (except for Norfolk island). The primary purpose of this number is to prove Medicare eligibility to receive subsidized care in Australia.

Detection method: Checksum and (pattern match or context)

Context:

  • Medicare
  • Australia
  • IRN
AUSTRALIA_TAX_FILE_NUMBER

An Australian tax file number (TFN) is a number issued by the Australian Tax Office for taxpayer identification. Every taxpaying entity, such as an individual or an organization, is assigned a unique number.

Detection method: Checksum and (pattern match or context)

Context:

  • Tax File Number
  • TFN
  • Australian Tax Office

 Brazil Specific (optional)


BRAZIL_CPF_NUMBER 

The Cadastro de Pessoas Físicas (CPF) number, or Natural Persons Register number, is an 11-digit number used in Brazil for taxpayer identification.

Detection method: Checksum and (pattern match or context)

Context:

  • CPF
  • Cadastro de Pessoas Físicas
  • Pessoas Físicas
  • Tax Number
  • Taxpayer

 Canada Specific (optional)


CANADA_BC_PHN

The British Columbia Personal Health Number (PHN) is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of British Columbia.

Detection method: Pattern match or 10 digits with context

Context:

  • BC ID
  • PHN
  • British Columbia
  • Personal Health Number
  • Services Card
  • Canadian health insurance number
  • Canadian health ID
CANADA_OHIP

The Ontario Health Insurance Plan (OHIP) number is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of Ontario.

Detection method: Pattern match and checksum

CANADA_PASSPORT

Canadian passport number.

Detection method: Pattern match and context

Context:

  • Canada
  • Canadian
  • Numéro de passeport
  • Passport
  • Travel Document
  • document number
CANADA_QUEBEC_HIN

The Quebec Health Insurance Number (HIN) is issued to citizens, permanent residents, temporary workers, students and other individuals who are entitled to health care coverage in the Province of Quebec.

Detection method: Pattern match

CANADA_SOCIAL_INSURANCE_NUMBER 

The Canadian Social Insurance Number (SIN) is the main identifier used in Canada for citizens, permanent residents, and those on work or study visas. With a Canadian SIN and mailing address, one can apply for health care coverage, driver’s licenses, and other important services.

Detection method: Checksum and (pattern match or context)

 China Specific (optional)


CHINA_PASSPORT 

Chinese passport number.

Detection method: Pattern match and context

Context:

  • China
  • Passport
  • 中华人民共和国护照
  • 护照号
  • Hùzhào hào
  • 护照

 France Specific (optional)

FRANCE_CNI

The Carte Nationale d’Identité Sécurisée (CNI or CNIS) is the French national identity card. It’s an official identity document consisting of a 12-digit identification number. This number is commonly used when opening bank accounts and when paying by check. It can sometimes be used instead of a passport or visa within the European Union (EU) and in some other countries.

Detection method: Pattern match and context

Context:

  • CNI
  • CNIS (carte nationale d’identité securisée)
  • identité
  • identite
FRANCE_NIR

The Numéro d’Inscription au Répertoire (NIR) is a permanent personal identification number that’s also known as the French social security number for services including healthcare as well as pensions.

Detection method: Pattern match and checksum

FRANCE_PASSPORT 

French passport number.

Detection method: Pattern match and context

Context:

  • France
  • Passport
  • Passeport
  • REPUBLIC FRANCAIS
  • Numéro de passeport

 Germany Specific (optional)

GERMANY_PASSPORT 

German passport number. The format of a German passport number is 10 alphanumeric characters, chosen from numerals 0-9 and letters C, F, G, H, J, K, L, M, N, P, R, T, V, W, X, Y, Z.

Detection method: Pattern match and context

Context:

  • GERMANY
  • REISEPASS
  • PASSPORT
  • Europäische Union
  • Bundesrepublik
  • Deutschland
  • reisepassnummer

 India Specific (optional)

INDIA_PAN_INDIVIDUAL 

The Personal Permanent Account Number (PAN) is a unique 10-digit alphanumeric identifier used for identification of individuals, particularly those who pay income tax. It’s issued by the Indian Income Tax Department. The PAN is valid for the lifetime of the holder.

Detection method: Pattern match and context

Context:

  • India
  • Account Number
  • PAN
  • Taxpayer ID

 Japan Specific (optional)

JAPAN_INDIVIDUAL_NUMBER 

Sometimes referred to as “My Number,” the Japanese national identification number is a new national ID number as of January 2016.

Context:

  • 個人番号
  • マイナンバー
  • 身分証明書
  • Individual Number
  • My Number
  • Identity Card
JAPAN_PASSPORT

Japanese passport number. The passport number consists of two alphabetic characters followed by seven digits.

Detection method: Pattern match and context

Context:

  • パスポート
  • パスポート番号
  • Japan
  • Passport

 Korea Specific (optional)

KOREA_PASSPORT  Korean passport number. There are two different formats:

  • Pre-2008 passport numbers consist of 9 characters. The first two characters are the issued local code, corresponding to the holder’s gu, or district. The remaining seven digits are the serial number.
  • Post-2008 passport numbers consist of 9 characters. The first character is either a single letter M, denoting PM passports, or the letter S for PS passports. The remaining 8 digits are the serial number.

Detection method: Pattern match and context

Context:

  • 여권
  • 대한민국
  • Passport
  • Korea
KOREA_RRN

South Korean Social Security Number.

Detection method: Pattern match, checksum and context

Context:

  • 주민등록번호
  • 住民登錄番號
  • korean
  • korea
  • KSSN
  • RRN
  • resident registration
  • registration number
  • social security

 Mexico Specific (optional)

MEXICO_CURP_NUMBER 

The Mexico Clave Única de Registro de Población (CURP) number, or Unique Population Registry Code or Personal Identification Code number. This is an 18-character state-issued identification number assigned by the Mexican government to citizens or residents of Mexico and used for taxpayer identification.

Detection method: Pattern match and context

Context:

  • CURP
  • Clave Única
  • Población
  • Registro
  • UPRC
  • Personal ID
  • Registry Code
MEXICO_PASSPORT

Mexican passport number.

Detection method: Pattern match and context

Context:

  • Mexico
  • Passport
  • Pasaporte
  • México
  • Mexican

 Netherlands Specific (optional)

NETHERLANDS_BSN_NUMBER 

Netherlands Burgerservicenummer (BSN), or Citizen’s Service Number, is a state-issued identification number that’s on driver’s licenses, passports, and international ID cards.

Detection method: Checksum and (pattern match or context)

Context:

  • BSN
  • Personal Number
  • Burgerservicenummer
  • Netherlands
  • Identification Number
  • Service Number
  • sofinummer
  • sofi
  • personalnummer

 Spain Specific (optional)

SPAIN_NIE_NUMBER 

The Número de Identificación de Extranjeros (NIE) is an identification number for foreigners living or doing business in Spain. An NIE number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación de Extranjeros
  • NIE
SPAIN_NIF_NUMBER

The Número de Identificación Fiscal (NIF) is a government identification number for Spanish citizens. An NIF number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación Fiscal
  • NIF
SPAIN_PASSPORT

Spanish Ordinary Passport (Pasaporte Ordinario) number. There are 4 different types of passports in Spain. This detector is for the Ordinary Passport (Pasaporte Ordinario) type, which is issued for ordinary travel, such as vacations and business trips.

Detection method: Pattern match and context

Context:

  • Passport
  • Pasaporte
  • Espana
  • España
  • Spain

 United Kingdom Specific (optional)

UK_DRIVERS_LICENSE_NUMBER

driver’s license number for the United Kingdom of Great Britain and Northern Ireland (UK).

Detection method: Pattern match

UK_NATIONAL_HEALTH_SERVICE_NUMBER 

National Health Service (NHS) number is the unique number allocated to a registered user of the three public health services in England, Wales, and the Isle of Man.

Detection method: Pattern match and checksum

UK_NATIONAL_INSURANCE_NUMBER

The National Insurance number (NINO) is a number used in the United Kingdom (UK) in the administration of the National Insurance or social security system. It identifies people, and is also used for some purposes in the UK tax system. The number is sometimes referred to as NI No or NINO.

Detection method: Pattern match (with delimiters) or pattern match and context words

UK_PASSPORT

United Kingdom (UK) passport number.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Passport
  • Travel Document
UK_TAXPAYER_REFERENCE

United Kingdom (UK) Unique Taxpayer Reference (UTR) number. This number, comprised of a string of 10 decimal digits, is an identifier used by the UK government to manage the taxation system. Unlike other identifiers, such as the passport number or social insurance number, the UTR is not listed on official identity cards.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Taxpayer
  • UTR

 United States Specific (optional)

AMERICAN_BANKERS_CUSIP_ID

Committee on Uniform Security Identification Procedures (CUSIP) number is a 9-character alphanumeric code that identifies a North American financial security.

Detection method: Checksum or context (when check digit not present)

Context: CUSIP

US_ADOPTION_TAXPAYER_IDENTIFICATION_NUMBER

An Adoption Taxpayer Identification Number (ATIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS) to individuals who are in the process of legally adopting a US citizen or resident child.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • ATIN
  • TIN
  • Pending TIN
  • Adoption
  • Adoptions
  • Pending US adoption
  • Pending US adoptions
US_BANK_ROUTING_MICR

The American Bankers Association (ABA) Routing Number (also called the transit number) is a nine-digit code. It’s used to identify the financial institution that’s responsible to credit or entitled to receive credit for a check or electronic transaction.

Detection method: Checksum on 9 digits

Context: The following hotwords:

  • ABA
  • routing
  • transit
  • bank
  • banking
US_DEA_NUMBER

Drug Enforcement Administration (DEA) number is assigned to a health care provider by the US DEA. It allows the health care provider to write prescriptions for controlled substances. The DEA number is often used as a general “prescriber number” that is a unique identifier for anyone who can prescribe medication.

Detection method: Pattern match and checksum

US_EMPLOYER_IDENTIFICATION_NUMBER

An Employer Identification Number (EIN) is also known as a Federal Tax Identification Number, and is used to identify a business entity.

Detection method: Pattern match or 9 digits with context

Context:

  • employer
  • patronal
  • ein
US_HEALTHCARE_NPI

The National Provider Identifier (NPI) is a unique 10-digit identification number issued to health care providers in the United States by the Centers for Medicare and Medicaid Services (CMS). The NPI has replaced the unique provider identification number (UPIN) as the required identifier for Medicare services. It’s also used by other payers, including commercial healthcare insurers.

Detection method: Checksum on 10 digits

US_INDIVIDUAL_TAXPAYER_IDENTIFICATION_NUMBER 

An Individual Taxpayer Identification Number (ITIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS). An ITIN is a tax processing number only available for certain nonresident and resident aliens, their spouses, and dependents who cannot get a Social Security Number (SSN).

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Tax ID
  • Tax identification
  • ITIN
  • TIN
  • Individual Tax Identification Number
US_PASSPORT

United States passport number.

Detection method: Pattern match and context

Context:

  • United States
  • USA
  • Passport
  • Travel
  • Document
US_PREPARER_TAXPAYER_IDENTIFICATION_NUMBER

Preparer Taxpayer Identification Number (PTIN) is an identification number that all paid tax return preparers must use on US federal tax returns or claims for refund submitted to the Internal Revenue Service (IRS).

Detection method: Pattern match and context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • PTIN
  • TIN
  • Preparer Taxpayer Identification Number
US_SOCIAL_SECURITY_NUMBER

A United States Social Security number (SSN) is a 9-digit number issued to US citizens, permanent residents, and temporary residents. The Social Security number has effectively become the United States national identification number.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
US_VEHICLE_IDENTIFICATION_NUMBER

vehicle identification number (VIN) is a unique 17-digit code assigned to every on-road motor vehicle.

Detection method: Checksum and pattern match

Context:

  • VIN
  • Vehicle Identification Number
See also:  What Counts as Personal Information


Leave a comment

Your email address will not be published. Required fields are marked *