Image by https://unsplash.com/@jankolar
An Introduction to WHOIS
Last updated: May 21, 2025
Introduction
This section will focus on WHOIS, a long-standing internet protocol used to query databases for information about the registration of domain names (and other internet resources like IP addresses, though our focus here is domains). Think of it as a way to ask, "Who is responsible for this domain name?"
The data you get from a WHOIS query can include details like who registered the domain, the registrar they used, important dates related to the registration, the domain's current status, and the nameservers it points to. Like many things on the internet, there are variations and exceptions, especially with different types of domains, but we'll cover the general principles.
Of Note: GDPR's Profound Impact on WHOIS
Before diving deeper, it's crucial to understand how a major piece of legislation dramatically changed the WHOIS landscape.
Prior to the European Union's General Data Protection Regulation (GDPR), which went into effect in May 2018, public WHOIS records were often a "gold mine" of personal information. Unless the domain registrant (the person or entity registering the domain) used a privacy or proxy service, details such as their name, address, phone number, and email address were frequently publicly viewable. ICANN (Internet Corporation for Assigned Names and Numbers) policies required registrars to make efforts to ensure the accuracy of this data, with specific validation steps often focused on email addresses. However, much of this personal information was readily available.
When GDPR was enacted, it imposed strict rules on handling the personal data of individuals in the EU. This had a global ripple effect on WHOIS. The privacy services that people once paid for (or sometimes received as a free add-on from their registrar) quickly started to become the default for many gTLDs (generic Top-Level Domains).
Why the shift to widespread redaction of personal information? Registrars worldwide had to consider:
- Investing heavily in complex systems to determine who was covered by GDPR and manage data access accordingly (often called "Know Your Customer" or KYC programs in other contexts).
- Risking non-compliance with GDPR by continuing to publish personal data.
- The simpler, safer option: Redacting most personal information from public WHOIS outputs for gTLDs by default, regardless of where the registrant lived.
As a result, much of the easily accessible personal information vanished from public WHOIS records for gTLDs. While this has diminished the value of WHOIS for those who relied on PII (Personally Identifiable Information) for outreach or investigation, the remaining structured registration data (like dates, registrar details, and domain status) is still very important for technical operations, security research, and tracking domain activity.
WHOIS Fields and Their Meanings (Post-GDPR gTLD Context)
Even with redaction, several key pieces of information are generally available and reliable in a WHOIS record for a gTLD. Here are some common non-redacted fields:
Field | Meaning |
---|---|
Registrar | The ICANN-accredited organization through which the domain name was registered (e.g., GoDaddy, Namecheap, Tucows). Registration-related issues (like renewals or transfers) are handled here. They also have abuse contacts. |
Creation Date | The date when the domain name was first registered. |
Expiration Date | The date when the current registration period for the domain is set to expire. If not renewed, it may eventually become available for others to register (after various grace and redemption periods). |
Updated Date | The date when the domain's WHOIS record was last modified. This could be due to renewal, nameserver changes, status updates, or contact information edits (even if redacted). |
Domain Status | These are EPP (Extensible Provisioning Protocol) status codes (e.g., ok , clientTransferProhibited , redemptionPeriod ). They indicate the current state of the domain, which determines what actions can be performed on it. For example, clientTransferProhibited prevents unauthorized transfers. |
Nameservers | The authoritative DNS servers for the domain. These servers hold the actual DNS records (like A, MX, CNAME records) that tell other computers where to find the website, mail servers, etc., for that domain. |
Important Note on ccTLDs (Country Code Top-Level Domains):
The rules and practices described above primarily apply to gTLDs (like .com
, .org
, .net
). ccTLDs (like .uk
, .de
, .jp
) are managed by individual country-specific registries. These registries set their own WHOIS policies. Some may provide extensive public WHOIS data, some may offer very limited information, some may have their own consent mechanisms, and some may not have a public WHOIS service at all. The impact of GDPR also varies across ccTLDs.
WHOIS Access: Traditional Port 43 vs. Modern RDAP
There are primarily two ways to perform a WHOIS lookup:
-
Port 43 WHOIS:
- This is the original, legacy protocol for WHOIS lookups. It's been around almost as long as domain names themselves.
- It operates over TCP port 43.
- Challenge: The format of the data returned via Port 43 can vary significantly between different registrars and registries. Fields might be named differently, appear in a different order, or have slightly different interpretations, making automated parsing difficult.
- It's a plain text protocol, which can sometimes be less efficient.
-
Registration Data Access Protocol (RDAP):
- RDAP is a newer, standardized protocol designed to replace the legacy Port 43 WHOIS. ICANN requires all gTLD registries to support RDAP. Many ccTLDs are also adopting it.
- It uses HTTP(S) (the same protocol your web browser uses), making it more web-friendly.
- Advantage: RDAP provides data in a consistent, predictable, structured format (JSON – JavaScript Object Notation), which is much easier for software to parse and process.
- It also offers better support for internationalized domain names (IDNs), more secure access to data, and more informative error messages.
- While the specific web address (URL or "endpoint") to query RDAP will differ for each registry, the structure of the returned data is standardized.
Which to Use?
- For gTLDs, RDAP is the modern standard and generally preferred due to its structured output.
- When investigating a domain, especially a gTLD, checking both Port 43 and RDAP (if available from your WHOIS client or online tool) can sometimes yield slightly different presentations or additional context during this transitional period.
- For ccTLDs, you might need to start with a Port 43 lookup, as RDAP adoption is not yet universal. Many ccTLD registries provide web-based WHOIS lookups on their own websites.
Understanding WHOIS, its current limitations, and the available data is a key skill for anyone working with domain names, whether for administration, cybersecurity, or general internet research.