EDES-IT

Services - Hardware Security Modules (HSMs): What They Are, Why They’re Used, and How to Use Them Securely

1. What is an HSM?

A Hardware Security Module (HSM) is a dedicated, tamperresistant hardware device designed to:

Generate cryptographic keys (e.g., RSA, ECC, AES)
Store keys securely so they never leave the device in plaintext
Perform cryptographic operations (encryption, decryption, signing, verification, key wrapping, etc.) inside a protected environment

You can think of an HSM as a “cryptographic vault” with a CPU: it both safeguards keys and uses them, without exposing the raw key material to the rest of the system.

Common HSM form factors:

Networkattached HSMs (appliances accessed over the network)
PCIe HSMs (cards inside servers, especially in data centers)
Cloud HSMs (HSMs managed by cloud providers, e.g., AWS CloudHSM, Azure Dedicated HSM, GCP Cloud HSM)
USB/tokenstyle devices (smaller, often for personal or smallscale key protection, sometimes called smartcards or security tokens)

Typical use cases:

TLS/HTTPS server key protection
Certificate Authorities (CAs) and PKI key management
Payment systems (e.g., card PIN processing, EMV, HSMs for PCIDSS compliance)
Code signing (signing binaries, containers, updates)
Database and storage encryption key protection
Hardwarebacked identity and authentication (e.g., signing tokens, SAML assertions, JWTs)

2. What is Achieved by Using HSMs?

Using an HSM is about improving the security and assurance around cryptographic keys and operations. Key benefits:

2.1 Stronger Key Protection

Keys never appear in plaintext in system memory (RAM) or on disk.
Keys are generated and stored inside the HSM; the host only gets:
- Public keys, and/or
- Cryptographic results (e.g., a signature or ciphertext).
Even if an attacker compromises the server OS or hypervisor, extracting private keys from the HSM is significantly harder (sometimes practically infeasible).

2.2 Tamper Resistance and Tamper Evidence

HSMs are built to resist physical attacks (opening casing, probing, voltage or temperature glitches).
Many comply with standards like:
- FIPS 1402 / 1403 (Levels 2–4)
- Common Criteria
Attempted tampering can trigger:
- Zeroization (erasing all keys)
- Evident logs or alarms

2.3 Strong Access Control and Role Separation

Access to keys is controlled via:
- PINs/passwords
- Smartcards
- Multifactor or quorumbased access (e.g., MofN key custodians)
Roles often include:
- Security Officer / Crypto Officer – configures HSM, policies
- Crypto User / Application – uses keys but cannot change policies
This supports separation of duties, a key security principle.

2.4 Auditable, HighAssurance Crypto Operations

Operations can be logged by the HSM (subject to configuration).
This contributes to regulatory and compliance requirements:
- PCIDSS (payments)
- eIDAS, ETSI standards (qualified signatures)
- Industryspecific regulations (healthcare, government, etc.)

2.5 Performance and Offloading

HSMs often include hardware accelerators for highvolume operations:
- TLS termination at scale
- Batch signing (e.g., for software artifacts, documents)
Offloading crypto from the main CPU can improve performance and latency.

3. How HSMs Are Typically Used Operationally

At a high level, using an HSM often follows this pattern:

3.1 Provision the HSM

Initialize the device with an HSM Security Officer (SO) PIN or credential.
Set security policies (e.g., which algorithms, key lengths, access controls).
Configure backups and high availability (HA clusters if needed).

3.2 Create or Import Keys

Generate keys inside the HSM:
- Asymmetric keys (RSA, ECC) used for TLS, signing, etc.
- Symmetric keys (AES) for data encryption.
Or import keys securely:
- Typically via wrapped keys or key encryption keys (KEKs).
- Never import raw private keys over insecure channels.

3.3 Integrate Applications

Applications integrate using:
- PKCS#11 (most common API for HSMs)
- CNG / KSP (Windows)
- JCE/JCA providers (Java)
- Vendorspecific SDKs and REST APIs
The app asks the HSM to:
- Sign data
- Decrypt data
- Generate random numbers
- Derive keys
The private keys remain inside; the app only gets back results.

3.4 Backup and HA

Keys are backed up to:
- Another HSM (clone)
- HSMprotected backup tokens or files
HSMs are often deployed in clusters so one failure doesn’t take down the service.

3.5 Monitoring and Auditing

Monitor:
- Error rates, performance, utilization
- Unauthorized access attempts
Audit:
- Who did what and when (admin actions, key creations, policy changes)

4. Operating an HSM Securely

Secure use of an HSM is not just about buying the hardware; it’s about process, configuration, and governance. Key best practices:

4.1 Security Policy and Governance

Define and document:
- Which applications can use which keys.
- Required key sizes and algorithms (e.g., RSA 3072/4096, ECDSA P256, Ed25519, AES256GCM).
- How keys are backed up, rotated, and destroyed.
Enforce least privilege:
- Each application or user only sees the keys it needs.
- Use separate key containers/partitions or logical HSMs per system/tenant.

4.2 Strong Authentication and Access Control

Require multifactor authentication for HSM admin accounts where possible.
Use quorum/MofN schemes for critical operations:
- For example, 2 of 4 security officers must be present to initialize a new partition or restore a backup.
Do not embed HSM PINs or passwords directly into source code:
- Use secret management tools or OS keyrings.
- Restrict OS access to configuration files containing sensitive references.

4.3 Key Management Lifecycle

Treat key lifecycle as a process:

Generation
- Prefer generating keys on the HSM.
- Use HSM hardware RNG or NISTapproved DRBG seeded by hardware entropy.
Distribution / Use
- Do not export private keys in plaintext.
- If keys must be moved (e.g. between HSMs), use secure key wrapping:
  - Key encryption keys inside HSMs.
  - Manual, controlled procedures, often with multiple custodians.
Rotation
- Define rotation timelines based on:
  - Regulatory obligations
  - Risk (exposure level, key usage)
- Implement automated or semiautomated renewal:
  - New key pair → new certificate → phased cutover.
Revocation and Destruction
- When decommissioning:
  - Revoke certificates (CRLs or OCSP).
  - Zeroize keys in the HSM (and in any backups) in a controlled and auditable way.

4.4 Secure Integration in Applications

Use standard APIs (PKCS#11, JCE, etc.) and welltested client libraries.
Separate:
- HSM admin credentials (for provisioning, key management)
- Application credentials (for using keys)
Validate that the application:
- Actually uses the HSM for the intended operations (e.g., TLS private key operations go through the HSM, not a software key).
- Handles errors gracefully, avoids logging sensitive info.

4.5 Physical Security and Environment

Place HSMs in secure facilities:
- Locked racks or cages in controlled data centers.
- Camera coverage, visitor logs, access control systems.
Protect HSM admin consoles and management networks:
- Management interfaces should be on restricted network segments.
- Use jump hosts / bastion servers with strong access controls.

4.6 Logging, Monitoring, and Incident Response

Enable and centralize:
- HSM logs: policy changes, login attempts, tamper events.
- Host and application logs: HSM errors, usage patterns.
Monitor for:
- Repeated authentication failures
- Unexpected key creation/deletion events
- Sudden spikes in cryptographic requests
Have a documented incident response playbook:
- HSM tamper event or suspected key compromise:
  - Lock down HSM or affected partitions.
  - Revoke associated certificates/keys.
  - Rotate to new keys and restore from knowngood backups if appropriate.

4.7 Compliance and Testing

Align HSM operations with:
- FIPS 140 level requirements (if applicable)
- PCIDSS, eIDAS, or other industry standards
Periodically:
- Pentest and redteam HSM usage (at least at the integration level).
- Run functional tests to ensure failover, backup/restore, and key rotation procedures work as expected.

5. Common Pitfalls to Avoid

Using the HSM only as a fancy key store but still exporting private keys to software.
Sharing HSM admin credentials among team members instead of using individual, auditable accounts
No backup strategy:
- Single HSM with critical keys and no backup → single point of failure.
Overpermissive key access policies:
- “One app partition for everything” increases blast radius.
Poor monitoring:
- No alerting on failed logins, tamper events, or unusual key operations.

6. Practical Example: HSM for TLS Key Protection

A typical secure deployment might look like:

Deploy a network HSM or cloud HSM.
Initialize it:
- Set SO credentials, define policies.
Generate a TLS key pair on the HSM (e.g., RSA 3072 or ECDSA P256).
Create a Certificate Signing Request (CSR) on the HSM.
Send CSR to your CA → receive certificate.
Configure your web servers or load balancer:
- Point them to the HSM via PKCS#11 / vendor plugin.
Verify:
- The server’s TLS private key operations (handshakes) are performed by the HSM.
Add:
- HA configuration (multiple HSM instances)
- Monitoring and alerting.