Changes for page h. Test SFT1: System baseline test

Last modified by Rosa Van Tuijn on 2026/04/13 12:47

From 1.15 to 1.14 From 4.1 to 3.3

From version 3.3

edited by Rosa Van Tuijn
on 2026/03/25 14:36

Change comment: There is no comment for this version

To version 1.15

edited by Tjalling Haije
on 2026/03/23 15:25

Change comment: There is no comment for this version

Raw
Rendered

Summary

Page properties (3 modified, 0 added, 0 removed)

Details

Page properties

Title

@@ -1,1 +1,1 @@
--h. Test SFT1: System baseline test
++h. Test SFT1

Author

@@ -1,1 +1,1 @@
--XWiki.RosaVanTuijn
++XWiki.TjallingHaije

Content

@@ -1,5 +1,5 @@
  **Experiment title**
--System baseline test
++[Short, descriptive title]
  **2. Objective / Research Question**
@@ -10,6 +10,7 @@
 . FR Health
 . FR and victim safety
++
  **3. Hypotheses / Expectations (optional)**
  See the claims from [[UC04.3: Detailed indoor exploration with ANYMAL (USAR)>>2\. Specification.b\. Use Cases.UC04\.0\: Detailed indoor exploration.UC04\.3\: Detailed indoor exploration with ANYMAL (USAR).WebHome]], being:
@@ -21,6 +21,7 @@
  * CL06: improved FR health
  * CL07: degraded mission efficiency
++
  **4. Scenario / Context**
  ASR3: Detailed indoor exploration
@@ -32,6 +32,7 @@
  ** **ANYmal **is mission-ready (battery, controls, and payload checked).
  ** **C3I is online**, role-based views are configurated (Operator/ Analyst/ Team leader/ Safety officer); **5G pods** are installed for connection.
++
  **5. Participants**
  * FR team ROBOT:
@@ -104,107 +104,12 @@
  * FR Team ROBOT does Building B
  * FR team HUMAN does Building A
++
  **9. Measurements**
--See the measurements from [[UC04.3: Detailed indoor exploration with ANYMAL (USAR)>>2\. Specification.b\. Use Cases.UC04\.0\: Detailed indoor exploration.UC04\.3\: Detailed indoor exploration with ANYMAL (USAR).WebHome]]. Compare outcomes between the two conditions teams if applicable.
++See the measurements from [[UC04.3: Detailed indoor exploration with ANYMAL (USAR)>>2\. Specification.b\. Use Cases.UC04\.0\: Detailed indoor exploration.UC04\.3\: Detailed indoor exploration with ANYMAL (USAR).WebHome]]
--|**Claim: Safety [CL01]**|**Measurment(s)**|**How to test **
--|Safer decision before entry|Safe path chosen|(((
--* Define ground-truth safe path(s) beforehand so that they can be compared to chosen entry path during the trial
--* Score paths beforehand: safe/ suboptimal/ unsafe
--)))
--|Better decision overall|"How good were you decisions looking back?"|(((
--* Hotwash reflection question (likert + open)
--* Observations of errors (e.g. near-incidents, unsafe choices)
--)))
--|Fewer dangerous situations|Near-incident reports|(((
--* Define hazards in buildings beforehand and check (with observations of during hotwash) if these hazards are avoided or not
--)))
--|**Claim: Situation Awareness [CL02]**|**Measurment(s)**|**How to test **
--|Before entry |(((
--Correctness of reported hazards, victims, layout
--)))|(((
--* Before entry let squad leader and entry team (seperatly) sketch/map or describe inside of building (harzards, victims, layout) > compare these with the ground-truth
--
--(((
--
--)))
--)))
--|During entry|(((
--FR path logs
--)))|(((
--* Compare plannend vs. executed path (how many or how big were the deviations? and were they acceptable?) and identify detours caused by misjudgment or new information
--
--(((
--
--)))
--)))
--|After mission|(((
--Correctness of reported hazards, victims, layout
--)))|(((
--* Repeat sketching/mapping or descriptive task and compare the outcomes (improvement/ deterioration vs pre-entry)
--)))
--
--|**Claim: Mission effectiveness [CL03] [CL04] [CL05]**|**Measurment(s)**|**How to test **
--|(((
--Acceptable workload
--)))|(((
--
--
--NASA-TLX
--)))|(((
--* Shortened post-trial NASA-TLX questionnaire and analyze score (discuss if these with experts to check if these are "acceptable")
--
--(((
--
--)))
--)))
--|(((
--Appropriate trust
--)))|(((
--
--
--Trust survey / interviews
--)))|(((
--*
--)))
--|(((
--Decision confidence
--)))|(((
--
--
--Retrospective decision quality rating
--)))|
--
--|**Claim: Health [CL06]**|**Measurment(s)**|**How to test **
--|(((
--Health remains acceptable
--)))|(((
--
--
--Health indicators
--)))|(((
--* Define acceptable ranges (e.g. HR, temperature, etc.) beforehand and log these ranges
--)))
--|(((
--Issues handled in time
--)))|(((
--
--
--Health issues tackled in acceptable time
--)))|(((
--* Timestamp when acceptable ranges are breached till moment detection and intervention moment (compare this with predefined response thresholds)
--)))
--
--|**Claim: Mission Efficiency [CL07]**|**Measurment(s)**|**How to test **
--|Faster mission execution|Total mission completion time|(((
--* Timestamp of start (after briefing) till all located victims have been extracted
--)))
--|Faster inside performance|First responder time inside|(((
--* Timestamp of entry building till exist building (last located victim is exstracted)
--)))
--
  **10. Procedure (Step-by-Step)**
 . __Team preparation__:
@@ -226,33 +226,32 @@
 . Time for tackling any health alerts
 . Track FR path
 . __Hotwash__:
--11. Questions (specified per role, couple questions per topic) on:
++11. Questions on:
 . Near-incident report
 . Path inside
 . FR location and status of hazards, victims, and layout
--111. Mission effectiveness: e.g. self reported effectiveness of decisions
++111. Mission effectiveness
 . Self-reported workload
 . Trust survey
--11. Group discussion with feedback
  **~11. Planning:**
  * Team preparation: Day before. 1 hour.
  * Trial 1:
--** Setup scenario: Test day 1. 09:00 - 09:30
++** Setup usecase: Test day 1. 09:00 - 09:30
  ** Setup tech and perform checks: Test day 1. 09:00 - 09:30
  ** Briefing of each team individually (outside building): Test day 1: 09:00 - 09:30
  ** Mission execution: Test day 1. 09:30 - 10:30
  ** Hot wash: Test day 1. 10:30-11:00
--** Scenario reset: test day 1. 10:30-11:00
++** Usecase reset: test day 1. 10:30-11:00
  * Teams switch location: Test day 1. 11:00-11:15
  * Trial 2:
--** (Setup scenario: Test day 1. 12:30 - 13:00)
++** Setup usecase: Test day 1. 12:30 - 13:00
  ** Setup tech and perform checks: Test day 1. 12:30 - 13:00
  ** Briefing of each team individually (outside building): Test day 1: 12:30 - 13:00
  ** Mission execution: Test day 1. 13:00 - 14:00
  ** Hot wash: Test day 1. 14:00-14:30
--* Cleanup area and store tech. 14:00-14:30
++** Cleanup area and store tech. 14:00-14:30
@@ -271,6 +271,7 @@
  ** ~~8x Actors or dummies for victims.
  * 8x Clipboards with papers and pens for hotwash and questionnaires
++
  **13. Dependencies**
  * Availability of materials
@@ -281,6 +281,7 @@
  ** People willing to play victim
  ** Experiment leaders
++
  **14. Success Criteria**
  When mission(s) completed by both teams, and we can compare their performance using the metrics.

Changes for page h. Test SFT1: System baseline test

Summary

Details

Welcome

Navigation

Recently Modified