diff --git a/content/en/docs/Autonomous UAT Agent/_index.md b/content/en/docs/Autonomous UAT Agent/_index.md index 03b6689..d330c57 100644 --- a/content/en/docs/Autonomous UAT Agent/_index.md +++ b/content/en/docs/Autonomous UAT Agent/_index.md @@ -8,3 +8,5 @@ description: > This section contains the core documentation for D66, focusing on how the Autonomous UAT Agent works and how to run it. +“Autonomous UAT Agent” is the current working title of the application; **UAT** stands for **User Acceptance Testing** (i.e., end-to-end testing from a user’s perspective). + diff --git a/content/en/docs/Autonomous UAT Agent/results/_index.md b/content/en/docs/Autonomous UAT Agent/results/_index.md index 2c5e733..d741beb 100644 --- a/content/en/docs/Autonomous UAT Agent/results/_index.md +++ b/content/en/docs/Autonomous UAT Agent/results/_index.md @@ -11,4 +11,3 @@ This section contains the outputs that support D66 claims: findings summaries an - [PoC Validation](./poc-validation.md) - [Golden Run (Telekom Header Navigation)](./golden-run-telekom-header-nav/) -- [Logs & Artifacts](./logs-and-artifacts.md) diff --git a/content/en/docs/Autonomous UAT Agent/results/logs-and-artifacts.md b/content/en/docs/Autonomous UAT Agent/results/logs-and-artifacts.md deleted file mode 100644 index d2c740e..0000000 --- a/content/en/docs/Autonomous UAT Agent/results/logs-and-artifacts.md +++ /dev/null @@ -1,34 +0,0 @@ ---- -title: "Logs & Artifacts" -linkTitle: "Logs & Artifacts" -weight: 2 -description: > - Where to find logs, screenshots, and reports relevant to D66 ---- - -## Repo locations - -- Local calibration and run logs: `logs/` -- Script outputs (varies by run): - - `Backend/IPCEI-UX-Agent-S3/staging_scripts/uxqa.db` - - `Backend/IPCEI-UX-Agent-S3/staging_scripts/Screenshots/` - - `Backend/IPCEI-UX-Agent-S3/staging_scripts/agent_output/` - -- Golden run evidence pack (recommended publishing location in docs): - - `docs/D66/results/golden-run-telekom-header-nav/` - -## What to capture for D66 - -- A representative run per capability: - - functional correctness checks - - visual quality audits - - task-based UX smoke tests -- For each run, capture: - - target URL - - timestamp - - key screenshots/overlays - - issue summaries (structured) - -## Notes - -If needed, we can add a consistent run naming convention and a small “how to export a D66 evidence pack” procedure. diff --git a/content/en/docs/Autonomous UAT Agent/results/poc-validation/_index.md b/content/en/docs/Autonomous UAT Agent/results/poc-validation/_index.md index 1f94994..78a64b8 100644 --- a/content/en/docs/Autonomous UAT Agent/results/poc-validation/_index.md +++ b/content/en/docs/Autonomous UAT Agent/results/poc-validation/_index.md @@ -37,7 +37,7 @@ The PoC covered three concrete use cases: - Golden run (Telekom header navigation): [Golden Run: Telekom Header Navigation](../golden-run-telekom-header-nav/) - **Artifact locations and guidance:** - - See: [Logs & Artifacts](../logs-and-artifacts/) + - See: Golden Run evidence pack and run outputs in `results/` - **Model configuration context (current vs legacy):** - See: [Model Stack](../../model-stack/)