Penetration Testing Standards: PTES, OSSTMM, NIST, and OWASP

Shubham JhaSeptember 12, 20245 min read

Authors

Shubham Jha

TL;DR

✓The four most-used penetration testing standards are PTES, OSSTMM, NIST SP 800-115, and the OWASP testing guides.
✓PTES is the most complete end-to-end engagement framework, covering pre-engagement through reporting.
✓OSSTMM is a rigorous, metrics-driven methodology that produces a measurable RAV score.
✓NIST SP 800-115 is the go-to reference for US government and regulated environments.
✓OWASP WSTG, MASVS, and the API Top 10 are the application-layer standards testers map findings to.

The fastest way to expose a weak pentest vendor is one question: which standard do you follow, and how do you cover what it leaves out? A strong answer names two or three and explains how they stack. A vague answer ("we use our own methodology") tells you the test will be ad hoc, and coverage will depend on whichever bug the tester happened to find interesting that week. We have inherited reports from exactly that kind of vendor: ten pages of nmap output, no test IDs, no mapping, no way to tell what was actually checked.

Professional testing is anchored to published standards so engagements are repeatable, defensible, and comparable. The four you will meet are PTES, OSSTMM, NIST SP 800-115, and the OWASP guides, and they are not competitors. They are different lenses: engagement process, measurable rigor, regulatory baseline, and application coverage. This guide shows what each does, how they combine, and how to read a report that uses them.

Table of contents

Why do penetration testing standards matter?
What is PTES?
What is OSSTMM?
What is NIST SP 800-115?
OWASP is where the application-layer depth lives
How do testers stack these standards?

Why do penetration testing standards matter?

Standards make testing repeatable and comparable. Without a methodology, two testers attacking the same target could deliver wildly different results and you would have no way to judge quality. A standard defines what gets tested, in what order, and to what depth, so coverage is not left to luck.

They also support compliance and protect you. Frameworks like SOC 2 and PCI DSS expect testing aligned to a recognized methodology, and a report that maps findings to a named standard is far easier to defend in an audit. When a finding cites a specific test ID, an engineer can look up exactly what was checked and how. That traceability is the difference between a report and a receipt.

There is a second, quieter benefit: standards tell you what was not tested. A report mapped to the WSTG can show which test IDs passed, which failed, and which were out of scope, so a quiet area reads as deliberately untested rather than silently skipped. Without a methodology you have no idea whether a clean section means secure or never looked at. That distinction is exactly what an auditor, and an honest engineering lead, needs to know.

Penetration testing standards compared

Standard	Focus	Best for
PTES	Full engagement lifecycle	Structuring any end-to-end test
OSSTMM	Measurable, metrics-driven testing	Defensible, comparable scores (RAV)
NIST SP 800-115	Policy-level testing baseline	Government and regulated industries
OWASP WSTG / MASVS / API Top 10	Application-layer coverage	Web, mobile, and API technical depth

What is PTES?

PTES, the Penetration Testing Execution Standard, is the most complete end-to-end engagement framework. It defines seven sections: pre-engagement interactions, intelligence gathering, threat modeling, vulnerability analysis, exploitation, post-exploitation, and reporting. It is the standard most often mapped to the classic penetration testing phases. In practice you do not follow PTES line by line; you use its seven sections as a table of contents so nothing, especially scoping and reporting, gets skipped under time pressure.

PTES is strong because it covers the whole lifecycle, including the business-side parts other standards skip entirely. The pre-engagement section alone, scope, rules of engagement, authorization, is where most engagements actually succeed or fail, and it is the part ad hoc testers skip under time pressure. Its separate technical guidelines recommend concrete tools per phase. The honest caveat: PTES has not seen major formal updates in years, so testers treat it as a structural backbone and layer current technique from OWASP and MITRE ATT&CK on top. If you want one framework to structure a full engagement, PTES is usually it, and it maps directly to the kickoff work in how to prepare for a penetration test.

What is OSSTMM?

OSSTMM, the Open Source Security Testing Methodology Manual from ISECOM, is a rigorous, metrics-driven approach focused on measurable operational security. Instead of a vulnerability checklist, it defines how to test across channels (physical, wireless, telecom, data networks, human) and produces a quantified RAV (Risk Assessment Value) you can track over time. A simplified RAV summary in a report reads like this:

Channel: Data Networks
  Visibility   12   Access  4   Trust  3   (porosity)
  Controls present: 9 of 14 expected
  RAV (actual security): 81.4 / 100
  ^ a number you can compare against last quarter, not a vibe

Its strength is scientific rigor and repeatability: results are measured, not just described, which makes it popular where you need defensible, comparable metrics across years or business units. A board that wants to know did our security posture improve since last year gets a real answer from a RAV trend line, where a vulnerability count alone is meaningless (fewer findings might just mean a shallower test). The tradeoff is that it is dense and less prescriptive about modern web tooling, so testers pair it with OWASP for application depth. Few commercial app tests run pure OSSTMM, but its measurement discipline shows up in good reports everywhere, and the channels concept (testing physical, wireless, telecom, and human alongside data networks) is a useful reminder that your attack surface is wider than your web app.

What is NIST SP 800-115?

NIST SP 800-115, the Technical Guide to Information Security Testing and Assessment, is the US government's reference for security testing. It defines four phases (planning, discovery, attack, reporting) and covers review techniques, target identification, and vulnerability validation at a high level. It is the go-to baseline for federal agencies, contractors, and regulated industries because it is authoritative and widely accepted by auditors.

It is deliberately less granular than PTES on technique, acting more as a policy-level standard you align to than a step-by-step playbook. Many engagements cite NIST for the compliance signature while using PTES or OWASP for execution detail. Put bluntly: if an auditor asks what standard, NIST is the safe answer; if a tester asks what do I actually run, it is not.

This is why teams get confused when they read a NIST-aligned report and find it thin on technique. NIST was never meant to be a technical checklist; it is the common vocabulary that lets a federal auditor, a contractor, and a vendor agree they ran a legitimate assessment. Expecting it to tell a tester which sqlmap flags to use is like expecting a building code to tell a carpenter how to swing a hammer. Pair it with OWASP for the swing.

OWASP is where the application-layer depth lives

OWASP publishes the application-layer standards testers map findings to. The Web Security Testing Guide (WSTG) is the definitive checklist for web apps; MASVS and the MSTG cover iOS and Android; and the API Security Top 10 covers REST and GraphQL, including broken object-level authorization (BOLA). These guides stay current in a way engagement frameworks do not. The API Top 10 added server-side request forgery as its own category in the 2023 revision, which tells you where attacker attention moved.

The payoff is traceability. Every WSTG test has an ID, and a strong report cites it next to each finding so the reader knows exactly what was checked:

WSTG-ATHZ-02  Testing for Privilege Escalation       FAIL  (IDOR, High)
WSTG-ATHZ-01  Testing Directory Traversal / File Incl PASS
WSTG-SESS-03  Testing for Session Fixation           PASS
WSTG-INPV-05  Testing for SQL Injection              FAIL  (blind, Critical)

A typical web test uses PTES for the overall process and the OWASP WSTG for the technical checklist, with each test ID traceable in the report. Mapping findings to OWASP categories also makes them easier for developers to act on, because they already think in those terms.

A subtle strength of the OWASP guides is that they evolve with the attack surface, where engagement frameworks do not. The API Top 10 promoting server-side request forgery to its own category in 2023, and the OWASP Top 10 adding broken access control as the number-one web risk, are not editorial whims; they track where real breaches moved. A tester who maps to the current edition is implicitly testing for what attackers are doing now, not what mattered five years ago. This is also why pairing a static engagement framework like PTES with a living coverage standard like the WSTG works so well: PTES gives you durable structure, OWASP gives you current technique.

How to vet a vendor's methodology

Ask

✓Which standards do you follow and why those?
✓Can you show a redacted report with test IDs mapped?
✓How do you cover what your main standard leaves out?

Red flags

✓We use our own proprietary methodology, no detail
✓Findings rated by raw CVSS with no exploitability context
✓Report is raw scanner output with no WSTG / ATT&CK mapping

How do testers stack these standards?

Testers rarely use one standard in isolation; they stack them by what each does best:

PTES        -> engagement shape (scope to report)
OWASP WSTG  -> per-surface technical checklist (web/API/mobile)
MITRE ATT&CK -> map the attack chain to adversary techniques
NIST 800-115 -> the language auditors recognize

In practice a strong web engagement reads as: PTES process, WSTG coverage, ATT&CK mapping for the attack chain, NIST cited for compliance. That stacking is why a structured penetration testing process beats an ad hoc one, and the table below summarizes which standard to reach for when. If you are keeping coverage current between annual tests, agentic pentesting applies the same standards-based methodology continuously rather than once a year.

Strobes insight

No single standard does it all. The pros use PTES for process, OWASP for app-layer depth, ATT&CK for the chain, and NIST for the compliance signature. If a vendor cites only one, ask how they cover the rest.

Frequently asked questions

What is the best penetration testing standard?

There is no single best one. PTES is the most complete engagement framework, OWASP WSTG is best for web app depth, NIST SP 800-115 suits regulated environments, and OSSTMM offers measurable rigor. Most professional tests combine several.

What is the difference between PTES and OWASP?

PTES is a full engagement lifecycle framework covering scoping through reporting. OWASP guides like the WSTG are application-layer coverage standards. Testers commonly use PTES for overall process and OWASP for the technical web or API checklist.

Is NIST SP 800-115 required for compliance?

It is not always strictly required, but it is the authoritative reference for US government and regulated environments, and auditors widely accept testing aligned to it. Many engagements cite NIST for compliance while using PTES or OWASP for execution detail.

What does OSSTMM measure?

OSSTMM produces a RAV (Risk Assessment Value), a quantified score of an environment's operational security across channels like physical, wireless, and data networks. Its emphasis is measurable, repeatable results rather than a simple vulnerability list.

Which standard covers API testing?

The OWASP API Security Top 10 is the primary standard for API penetration testing, covering issues like broken object-level authorization (BOLA). Testers pair it with PTES for overall engagement structure.

How does MITRE ATT&CK relate to these standards?

ATT&CK is not a testing standard but a knowledge base of adversary tactics and techniques. Testers map post-exploitation behavior to ATT&CK so defenders can tie each finding to a specific detection gap, complementing PTES and OWASP.

Sources and references

Shubham Jha

Security Researcher, Strobes

Shubham Jha leads offensive security research at Strobes, focused on web and API exploitation and red team tradecraft.

Back to Blog

Penetration Testing

Penetration Testing Standards: PTES, OSSTMM, NIST, and OWASP

Shubham JhaSeptember 12, 20245 min read

Authors

Shubham Jha

TL;DR

✓The four most-used penetration testing standards are PTES, OSSTMM, NIST SP 800-115, and the OWASP testing guides.
✓PTES is the most complete end-to-end engagement framework, covering pre-engagement through reporting.
✓OSSTMM is a rigorous, metrics-driven methodology that produces a measurable RAV score.
✓NIST SP 800-115 is the go-to reference for US government and regulated environments.
✓OWASP WSTG, MASVS, and the API Top 10 are the application-layer standards testers map findings to.

Table of contents

Why do penetration testing standards matter?
What is PTES?
What is OSSTMM?
What is NIST SP 800-115?
OWASP is where the application-layer depth lives
How do testers stack these standards?

Why do penetration testing standards matter?

Penetration testing standards compared

Standard	Focus	Best for
PTES	Full engagement lifecycle	Structuring any end-to-end test
OSSTMM	Measurable, metrics-driven testing	Defensible, comparable scores (RAV)
NIST SP 800-115	Policy-level testing baseline	Government and regulated industries
OWASP WSTG / MASVS / API Top 10	Application-layer coverage	Web, mobile, and API technical depth

What is PTES?

What is OSSTMM?

Channel: Data Networks
  Visibility   12   Access  4   Trust  3   (porosity)
  Controls present: 9 of 14 expected
  RAV (actual security): 81.4 / 100
  ^ a number you can compare against last quarter, not a vibe

What is NIST SP 800-115?

OWASP is where the application-layer depth lives

The payoff is traceability. Every WSTG test has an ID, and a strong report cites it next to each finding so the reader knows exactly what was checked:

WSTG-ATHZ-02  Testing for Privilege Escalation       FAIL  (IDOR, High)
WSTG-ATHZ-01  Testing Directory Traversal / File Incl PASS
WSTG-SESS-03  Testing for Session Fixation           PASS
WSTG-INPV-05  Testing for SQL Injection              FAIL  (blind, Critical)

How to vet a vendor's methodology

Ask

✓Which standards do you follow and why those?
✓Can you show a redacted report with test IDs mapped?
✓How do you cover what your main standard leaves out?

Red flags

✓We use our own proprietary methodology, no detail
✓Findings rated by raw CVSS with no exploitability context
✓Report is raw scanner output with no WSTG / ATT&CK mapping

How do testers stack these standards?

Testers rarely use one standard in isolation; they stack them by what each does best:

PTES        -> engagement shape (scope to report)
OWASP WSTG  -> per-surface technical checklist (web/API/mobile)
MITRE ATT&CK -> map the attack chain to adversary techniques
NIST 800-115 -> the language auditors recognize

Strobes insight

Frequently asked questions

What is the best penetration testing standard?

What is the difference between PTES and OWASP?

Is NIST SP 800-115 required for compliance?

What does OSSTMM measure?

Which standard covers API testing?

How does MITRE ATT&CK relate to these standards?

Sources and references

Shubham Jha

Security Researcher, Strobes

Shubham Jha leads offensive security research at Strobes, focused on web and API exploitation and red team tradecraft.

Penetration Testing Standards: PTES, OSSTMM, NIST, and OWASP

Table of Contents

Authors

Share

Why do penetration testing standards matter?

What is PTES?

What is OSSTMM?

What is NIST SP 800-115?

OWASP is where the application-layer depth lives

How do testers stack these standards?

Frequently asked questions

Sources and references

Penetration Testing Standards: PTES, OSSTMM, NIST, and OWASP

Table of Contents

Authors

Share

Why do penetration testing standards matter?

What is PTES?

What is OSSTMM?

What is NIST SP 800-115?

OWASP is where the application-layer depth lives

How do testers stack these standards?

Frequently asked questions

Sources and references