Verified Data Logo

PII Audit Reference Guide

Audit Method for Detecting Personal Information

Even if you delete personal identifiable information (PII) from your Google Analytics reports, the damage has been done. In addition, such sensitive information passed in URLs and cookies is also likely to be collected by other trackers on your pages and all the servers/routers in transit. Better to audit and route out PII regularly. This reference guide describes our methodology.

PII Audit Method & Dimensions

We integrate with smart AI technology from Google to identify suspected personal data. All processing is done within the EU. When PII is suspected and flagged in an audit, the match item is hashed during processing and labelled as “scrubbed”. That is, Verified Data does NOT store any personal data found – the issue remains isolated for you to investigate and fix.

The DATA Inspector audits the following Google Analytics dimensions for suspected of PII:

    • Page URLs*
    • Referral Page URLs*
    • Event Parameters
    • Campaign Parameters
    • Custom Dimensions
    • E-commerce Affiliation

*Google Analytics collects page URLs by default – and in vast quantities. To ensure the best chance of detection, we pre-filter URLs for any reference of “@”. That is, the haystack size is dramatically reduced if we assume PII within a URL is highly likely to contain at least an email address. Note: the other dimensions are not pre-filtered.

List of Specific PII Checks

The Global Default dimensions are always checked for PII and these cannot be disabled. You can also check specific fields such as tax IDs, social security numbers etc from our list of 39 countries:

Argentina, Australia, Belgium, Brazil, Canada, Chile, China, Colombia, Denmark, France, Finland, Germany, Hong Kong, India, Indonesia, Ireland, Isreal, Italy, Japan, Korea, Mexico, Netherlands, New Zealand, Norway, Paraguay, Peru, Poland, Portugal, Singapore, South Africa, Spain, Sweden, Taiwan, Thailand, Turkey, United Kingdom, United States, Uruguay, Venezuela.

 

Global Defaults (always checked)


ADVERTISING_ID Identifiers used by developers to track users for advertising purposes. These include Google Play Advertising IDs, Amazon Advertising IDs, Apple’s identifierForAdvertising (IDFA), and Apple’s identifierForVendor (IDFV)
CREDIT_CARD_NUMBER A credit card number is 12 to 19 digits long. They are used for payment transactions globally.

Detection method: Pattern match and checksum

EMAIL_ADDRESS An email address indicates the mailbox that emails are sent to or from. The maximum length of the domain name is 255 characters, and the maximum length of the local-part is 64 characters.

Detection method: Pattern and top level domain validation

IBAN_CODE An International Bank Account Number (IBAN) is defined as an internationally agreed-upon method for identifying bank accounts. It’s defined by the International Standard of Organization (ISO) 13616:2007 standard. ISO 13616:2007 was created by the European Committee for Banking Standards (ECBS). An IBAN consists of up to 34 alphanumeric characters including elements such as a country code or account number.

Detection method: Pattern match and checksum

IMEI_HARDWARE_ID An International Mobile Equipment Identity (IMEI) hardware identifier, used to identify mobile phones.

Detection method: Custom Logic, pattern match and context.

IP_ADDRESS An Internet Protocol (IP) address (either IPv4 or IPv6).

Detection method: Custom Logic, pattern match and context.

MAC_ADDRESS,

MAC_ADDRESS_LOCAL

A media access control address (MAC address), which is an identifier for a network adapter.Detection method: Custom logic, pattern match and context

Context:

  • mac address
  • hardware address
  • physical address
  • hwaddr
  • ether
  • ethernet
  • BSSID
MEDICAL_TERM Terms that commonly refer to a person’s medical condition or health.
PASSPORT A passport number that matches passport numbers for the following countries: Australia, Canada, China, France, Germany, Japan, Korea, Mexico, The Netherlands, Poland, Singapore, Spain, Sweden, Taiwan, United Kingdom, and the United States.
PHONE_NUMBER A telephone number or US toll-free telephone number.

Detection method: Custom logic, pattern match and context

SWIFT_CODE A SWIFT code is the same as a Bank Identifier Code (BIC). It’s a unique identification code for a particular bank. These codes are used when transferring money between banks, particularly for international wire transfers. Banks also use the codes for exchanging other messages.

Detection method: Pattern match and context

Context:

  • SWIFT
  • ISO 9362
  • Business Identifier Code
  • BIC
  • Business Entity Identifier
  • BEI
  • bank
  • interbank

Argentina Specific (optional)


ARGENTINA_DNI_NUMBER An Argentine Documento Nacional de Identidad (DNI), or national identity card, is used as the main identity document for citizens.

Australia Specific (optional)


AUSTRALIA_MEDICARE_NUMBER A 9-digit Medicare account number is issued to permanent residents of Australia (except for Norfolk island). The primary purpose of this number is to prove Medicare eligibility to receive subsidized care in Australia.

Detection method: Checksum and (pattern match or context)

Context:

  • Medicare
  • Australia
  • IRN
AUSTRALIA_TAX_FILE_NUMBER An Australian tax file number (TFN) is a number issued by the Australian Tax Office for taxpayer identification. Every taxpaying entity, such as an individual or an organization, is assigned a unique number.

Detection method: Checksum and (pattern match or context)

Context:

  • Tax File Number
  • TFN
  • Australian Tax Office

Belgium Specific (optional)


BELGIUM_NATIONAL_ID_CARD_NUMBER A 12-digit Belgian national identity card number.

Brazil Specific (optional)


BRAZIL_CPF_NUMBER The Cadastro de Pessoas Físicas (CPF) number, or Natural Persons Register number, is an 11-digit number used in Brazil for taxpayer identification.

Detection method: Checksum and (pattern match or context)

Context:

  • CPF
  • Cadastro de Pessoas Físicas
  • Pessoas Físicas
  • Tax Number
  • Taxpayer

Canada Specific (optional)


CANADA_BC_PHN The British Columbia Personal Health Number (PHN) is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of British Columbia.

Detection method: Pattern match or 10 digits with context

Context:

  • BC ID
  • PHN
  • British Columbia
  • Personal Health Number
  • Services Card
  • Canadian health insurance number
  • Canadian health ID
CANADA_OHIP The Ontario Health Insurance Plan (OHIP) number is issued to citizens, permanent residents, temporary workers, students, and other individuals who are entitled to health care coverage in the Province of Ontario.

Detection method: Pattern match and checksum

CANADA_PASSPORT Canadian passport number.

Detection method: Pattern match and context

Context:

  • Canada
  • Canadian
  • Numéro de passeport
  • Passport
  • Travel Document
  • document number
CANADA_QUEBEC_HIN The Quebec Health Insurance Number (HIN) is issued to citizens, permanent residents, temporary workers, students and other individuals who are entitled to health care coverage in the Province of Quebec.

Detection method: Pattern match

CANADA_SOCIAL_INSURANCE_NUMBER The Canadian Social Insurance Number (SIN) is the main identifier used in Canada for citizens, permanent residents, and those on work or study visas. With a Canadian SIN and mailing address, one can apply for health care coverage, driver’s licenses, and other important services.

Detection method: Checksum and (pattern match or context)

Chile Specific (optional)


CHILE_CDI_NUMBER A Chilean Cédula de Identidad (CDI), or identity card, is used as the main identity document for citizens.

China Specific (optional)


CHINA_PASSPORT Chinese passport number.

Detection method: Pattern match and context

Context:

  • China
  • Passport
  • 中华人民共和国护照
  • 护照号
  • Hùzhào hào
  • 护照

Colombia Specific (optional)


COLOMBIA_CDC_NUMBER A Colombian Cédula de Ciudadanía (CDC), or citizenship card, is used as the main identity document for citizens..

Denmark Specific (optional)


DENMARK_CPR_NUMBER A Personal Identification Number (CPR, Det Centrale Personregister) is a national ID number in Denmark. It is used with public agencies such as health care and tax authorities. Banks and insurance companies also use it as a customer number. The CPR number is required for people who reside in Denmark, pay tax or own property there..

Finland Specific (optional)


FINLAND_NATIONAL_ID_NUMBER A Finnish personal identity code, a national government identification number for Finnish citizens used on identity cards, driver’s licenses and passports.

France Specific (optional)

FRANCE_CNI The Carte Nationale d’Identité Sécurisée (CNI or CNIS) is the French national identity card. It’s an official identity document consisting of a 12-digit identification number. This number is commonly used when opening bank accounts and when paying by check. It can sometimes be used instead of a passport or visa within the European Union (EU) and in some other countries.

Detection method: Pattern match and context

Context:

  • CNI
  • CNIS (carte nationale d’identité securisée)
  • identité
  • identite
FRANCE_NIR The Numéro d’Inscription au Répertoire (NIR) is a permanent personal identification number that’s also known as the French social security number for services including healthcare as well as pensions.

Detection method: Pattern match and checksum

FRANCE_PASSPORT French passport number.

Detection method: Pattern match and context

Context:

  • France
  • Passport
  • Passeport
  • REPUBLIC FRANCAIS
  • Numéro de passeport

Germany Specific (optional)

GERMANY_PASSPORT German passport number. The format of a German passport number is 10 alphanumeric characters, chosen from numerals 0-9 and letters C, F, G, H, J, K, L, M, N, P, R, T, V, W, X, Y, Z.

Detection method: Pattern match and context

Context:

  • GERMANY
  • REISEPASS
  • PASSPORT
  • Europäische Union
  • Bundesrepublik
  • Deutschland
  • reisepassnummer

Hong Kong Specific (optional)


HONG_KONG_ID_NUMBER The 香港身份證, or Hong Kong identity card (HKIC), is used as the main identity document for citizens of Hong Kong.

India Specific (optional)

INDIA_PAN_INDIVIDUAL The Personal Permanent Account Number (PAN) is a unique 10-digit alphanumeric identifier used for identification of individuals, particularly those who pay income tax. It’s issued by the Indian Income Tax Department. The PAN is valid for the lifetime of the holder.

Detection method: Pattern match and context

Context:

  • India
  • Account Number
  • PAN
  • Taxpayer ID

Indonesia Specific (optional)


INDONESIA_NIK_NUMBER An Indonesian Single Identity Number (Nomor Induk Kependudukan, or NIK) is the national identification number of Indonesia. The NIK is used as the basis for issuing Indonesian resident identity cards (Kartu Tanda Penduduk, or KTP), passports, driver’s licenses and other identity documents.

Ireland Specific (optional)


IRELAND_DRIVING_LICENSE_NUMBER An Irish driving license number.
IRELAND_EIRCODE Eircode is an Irish postal code that uniquely identifies an address.
IRELAND_PASSPORT An Irish (IE) passport number.
IRELAND_PPSN The Irish Personal Public Service Number (PPS number, or PPSN) is a unique number for accessing social welfare benefits, public services, and information in Ireland.

Israel Specific (optional)


ISRAEL_IDENTITY_CARD_NUMBER The Israel identity card number is issued to all Israeli citizens at birth by the Ministry of the Interior. Temporary residents are assigned a number when they receive temporary resident status.

Italy Specific (optional)


ITALY_FISCAL_CODE An Italy fiscal code number is a unique 16-digit code assigned to Italian citizens as a form of identification.

Japan Specific (optional)

JAPAN_INDIVIDUAL_NUMBER Sometimes referred to as “My Number,” the Japanese national identification number is a new national ID number as of January 2016.

Context:

  • 個人番号
  • マイナンバー
  • 身分証明書
  • Individual Number
  • My Number
  • Identity Card
JAPAN_PASSPORT Japanese passport number. The passport number consists of two alphabetic characters followed by seven digits.

Detection method: Pattern match and context

Context:

  • パスポート
  • パスポート番号
  • Japan
  • Passport

Korea Specific (optional)

KOREA_PASSPORT Korean passport number. There are two different formats:

  • Pre-2008 passport numbers consist of 9 characters. The first two characters are the issued local code, corresponding to the holder’s gu, or district. The remaining seven digits are the serial number.
  • Post-2008 passport numbers consist of 9 characters. The first character is either a single letter M, denoting PM passports, or the letter S for PS passports. The remaining 8 digits are the serial number.

Detection method: Pattern match and context

Context:

  • 여권
  • 대한민국
  • Passport
  • Korea
KOREA_RRN A South Korean Social Security Number.

Detection method: Pattern match, checksum and context

Context:

  • 주민등록번호
  • 住民登錄番號
  • korean
  • korea
  • KSSN
  • RRN
  • resident registration
  • registration number
  • social security

Mexico Specific (optional)

MEXICO_CURP_NUMBER The Mexico Clave Única de Registro de Población (CURP) number, or Unique Population Registry Code or Personal Identification Code number. This is an 18-character state-issued identification number assigned by the Mexican government to citizens or residents of Mexico and used for taxpayer identification.

Detection method: Pattern match and context

Context:

  • CURP
  • Clave Única
  • Población
  • Registro
  • UPRC
  • Personal ID
  • Registry Code
MEXICO_PASSPORT Mexican passport number.

Detection method: Pattern match and context

Context:

  • Mexico
  • Passport
  • Pasaporte
  • México
  • Mexican

Netherlands Specific (optional)

NETHERLANDS_BSN_NUMBER A Netherlands Burgerservicenummer (BSN), or Citizen’s Service Number, is a state-issued identification number that’s on driver’s licenses, passports, and international ID cards.

Detection method: Checksum and (pattern match or context)

Context:

  • BSN
  • Personal Number
  • Burgerservicenummer
  • Netherlands
  • Identification Number
  • Service Number
  • sofinummer
  • sofi
  • personalnummer

New Zealand Specific (optional)


NEW_ZEALAND_IRD_NUMBER An IRD number is used in New Zealand by the government, financial institutions, and employers to identify an entity for tax-related events. Each entity is assigned one IRD number by New Zealand’s Inland Revenue Department.

Norway Specific (optional)


NORWAY_NI_NUMBER Norway‘s Fødselsnummer, National Identification Number, or Birth Number is assigned at birth, or on migration into the country. It is registered with the Norwegian Tax Office.

Paraguay Specific (optional)


PARAGUAY_CIC_NUMBER A Paraguayan Cédula de Identidad Civil (CIC), or civil identity card, is used as the main identity document for citizens.

Peru Specific (optional)


PERU_DNI_NUMBER A Peruvian Documento Nacional de Identidad (DNI), or national identity card, is used as the main identity document for citizens.

Poland Specific (optional)


POLAND_PESEL_NUMBER The PESEL number is the national identification number used in Poland. It is mandatory for all permanent residents of Poland, and for temporary residents staying there longer than 2 months. It is assigned to just one person and cannot be changed.
POLAND_NATIONAL_ID_NUMBER The Polish identity card number. is a government identification number for Polish citizens. Every citizen older than 18 years must have an identity card. The local Office of Civic Affairs issues the card, and each card has its own unique number.
POLAND_PASSPORT A Polish passport number. Polish passport is an international travel document for Polish citizens. It can also be used as a proof of Polish citizenship.

Portugal Specific (optional)


PORTUGAL_CDC_NUMBER A Portuguese Cartão de cidadão (CDC), or Citizen Card, is used as the main identity, Social Security, health services, taxpayer, and voter document for citizens.

Singapore Specific (optional)


SINGAPORE_NATIONAL_REGISTRATION_ID_NUMBER A unique set of nine alpha-numeric characters on the Singapore National Registration Identity Card.
SINGAPORE_PASSPORT A Singaporean passport number.

South Africa Specific (optional)


SOUTH_AFRICA_ID_NUMBER A South Africa ID number.

Spain Specific (optional)

SPAIN_NIE_NUMBER The Número de Identificación de Extranjeros (NIE) is an identification number for foreigners living or doing business in Spain. An NIE number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación de Extranjeros
  • NIE
SPAIN_NIF_NUMBER The Número de Identificación Fiscal (NIF) is a government identification number for Spanish citizens. An NIF number is needed for key transactions such as opening a bank account, buying a car, or setting up a mobile phone contract.

Detection method: Checksum and (pattern match or context)

Context:

  • Número de Identificación Fiscal
  • NIF
SPAIN_PASSPORT A Spanish Ordinary Passport (Pasaporte Ordinario) number. There are 4 different types of passports in Spain. This detector is for the Ordinary Passport (Pasaporte Ordinario) type, which is issued for ordinary travel, such as vacations and business trips.

Detection method: Pattern match and context

Context:

  • Passport
  • Pasaporte
  • Espana
  • España
  • Spain

Sweden Specific (optional)


SWEDEN_NATIONAL_ID_NUMBER A Swedish Personal Identity Number (personnummer), a national government identification number for Swedish citizens.
SWEDEN_PASSPORT A Swedish passport number.

Taiwan Specific (optional)


TAIWAN_PASSPORT A Taiwanese passport number.

Thailand Specific (optional)


THAILAND_NATIONAL_ID_NUMBER The Thai บัตรประจำตัวประชาชนไทย, or identity card, is used as the main identity document for Thai nationals.

Turkey Specific (optional)


TURKEY_ID_NUMBER A unique Turkish personal identification number, assigned to every citizen of Turkey.

United Kingdom Specific (optional)

UK_DRIVERS_LICENSE_NUMBER A driver’s license number for the United Kingdom of Great Britain and Northern Ireland (UK).

Detection method: Pattern match

UK_NATIONAL_HEALTH_SERVICE_NUMBER A National Health Service (NHS) number is the unique number allocated to a registered user of the three public health services in England, Wales, and the Isle of Man.

Detection method: Pattern match and checksum

UK_NATIONAL_INSURANCE_NUMBER The National Insurance number (NINO) is a number used in the United Kingdom (UK) in the administration of the National Insurance or social security system. It identifies people, and is also used for some purposes in the UK tax system. The number is sometimes referred to as NI No or NINO.

Detection method: Pattern match (with delimiters) or pattern match and context words

UK_PASSPORT United Kingdom (UK) passport number.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Passport
  • Travel Document
UK_TAXPAYER_REFERENCE A United Kingdom (UK) Unique Taxpayer Reference (UTR) number. This number, comprised of a string of 10 decimal digits, is an identifier used by the UK government to manage the taxation system. Unlike other identifiers, such as the passport number or social insurance number, the UTR is not listed on official identity cards.

Detection method: Pattern match and context

Context:

  • United Kingdom
  • Taxpayer
  • UTR

United States Specific (optional)

AMERICAN_BANKERS_CUSIP_ID A Committee on Uniform Security Identification Procedures (CUSIP) number is a 9-character alphanumeric code that identifies a North American financial security.

Detection method: Checksum or context (when check digit not present)

Context: CUSIP

US_ADOPTION_TAXPAYER_IDENTIFICATION_NUMBER An Adoption Taxpayer Identification Number (ATIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS) to individuals who are in the process of legally adopting a US citizen or resident child.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • ATIN
  • TIN
  • Pending TIN
  • Adoption
  • Adoptions
  • Pending US adoption
  • Pending US adoptions
US_BANK_ROUTING_MICR The American Bankers Association (ABA) Routing Number (also called the transit number) is a nine-digit code. It’s used to identify the financial institution that’s responsible to credit or entitled to receive credit for a check or electronic transaction.

Detection method: Checksum on 9 digits

Context: The following hotwords:

  • ABA
  • routing
  • transit
  • bank
  • banking
US_DEA_NUMBER A Drug Enforcement Administration (DEA) number is assigned to a health care provider by the US DEA. It allows the health care provider to write prescriptions for controlled substances. The DEA number is often used as a general “prescriber number” that is a unique identifier for anyone who can prescribe medication.

Detection method: Pattern match and checksum

US_EMPLOYER_IDENTIFICATION_NUMBER An Employer Identification Number (EIN) is also known as a Federal Tax Identification Number, and is used to identify a business entity.

Detection method: Pattern match or 9 digits with context

Context:

  • employer
  • patronal
  • ein
US_HEALTHCARE_NPI The National Provider Identifier (NPI) is a unique 10-digit identification number issued to health care providers in the United States by the Centers for Medicare and Medicaid Services (CMS). The NPI has replaced the unique provider identification number (UPIN) as the required identifier for Medicare services. It’s also used by other payers, including commercial healthcare insurers.

Detection method: Checksum on 10 digits

US_INDIVIDUAL_TAXPAYER_IDENTIFICATION_NUMBER An Individual Taxpayer Identification Number (ITIN) is a type of Tax Identification Number (TIN), issued by the Internal Revenue Service (IRS). An ITIN is a tax processing number only available for certain nonresident and resident aliens, their spouses, and dependents who cannot get a Social Security Number (SSN).

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Tax ID
  • Tax identification
  • ITIN
  • TIN
  • Individual Tax Identification Number
US_PASSPORT United States passport number.

Detection method: Pattern match and context

Context:

  • United States
  • USA
  • Passport
  • Travel
  • Document
US_PREPARER_TAXPAYER_IDENTIFICATION_NUMBER A Preparer Taxpayer Identification Number (PTIN) is an identification number that all paid tax return preparers must use on US federal tax returns or claims for refund submitted to the Internal Revenue Service (IRS).

Detection method: Pattern match and context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
  • Tax ID
  • Tax identification
  • PTIN
  • TIN
  • Preparer Taxpayer Identification Number
US_SOCIAL_SECURITY_NUMBER A United States Social Security number (SSN) is a 9-digit number issued to US citizens, permanent residents, and temporary residents. The Social Security number has effectively become the United States national identification number.

Detection method: Pattern match or 9 digits with context

Context:

  • SSN
  • Social
  • Social Security
  • Taxpayer
  • Taxpayer ID
  • Taxpayer identification
US_VEHICLE_IDENTIFICATION_NUMBER A vehicle identification number (VIN) is a unique 17-digit code assigned to every on-road motor vehicle.

Detection method: Checksum and pattern match

Context:

  • VIN
  • Vehicle Identification Number

Uruguay Specific (optional)


URUGUAY_CDI_NUMBER A Uruguayan Cédula de Identidad (CDI), or identity card, is used as the main identity document for citizens.

Venezuela Specific (optional)


VENEZUELA_CDI_NUMBER A Venezuelan Cédula de Identidad (CDI), or national identity card, is used as the main identity document for citizens.