ourdigital/our-claude-skills

Files

History

Andrew Yim 236be6c580 directory changes and restructuring

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

2025-12-22 02:01:41 +09:00

..

decision-meeting-prep.json

directory changes and restructuring

2025-12-22 02:01:41 +09:00

README.md

directory changes and restructuring

2025-12-22 02:01:41 +09:00

status-meeting-prep.json

directory changes and restructuring

2025-12-22 02:01:41 +09:00

README.md

Meeting Intelligence Skill Evaluations

Evaluation scenarios for testing the Meeting Intelligence skill across different Claude models.

Purpose

These evaluations ensure the Meeting Intelligence skill:

Gathers context from Notion workspace
Enriches with Claude research appropriately
Creates both internal pre-reads and external agendas
Distinguishes between Notion facts and Claude insights
Works consistently across Haiku, Sonnet, and Opus

Evaluation Files

decision-meeting-prep.json

Tests preparation for a decision-making meeting.

Scenario: Prep for database migration decision meeting
Key Behaviors:

Searches Notion for migration context (specs, discussions, options)
Fetches 2-3 relevant pages
Enriches with Claude research (decision frameworks, migration best practices)
Creates comprehensive internal pre-read with recommendation
Creates clean, professional external agenda
Clearly distinguishes Notion facts from Claude insights
Cross-links both documents

status-meeting-prep.json

Tests preparation for a status update or review meeting.

Scenario: Prep for project status review
Key Behaviors:

Gathers project metrics and progress from Notion
Fetches relevant pages (roadmap, tasks, milestones)
Adds Claude context (industry benchmarks, best practices)
Creates internal pre-read with honest assessment
Creates external agenda with structured flow
Includes source citations using mention-page tags
Time-boxes agenda items

Running Evaluations

Enable the meeting-intelligence skill
Submit the query from the evaluation file
Verify the skill searches Notion first (not Claude research)
Check that TWO documents are created (internal + external)
Verify Claude enrichment adds value without replacing Notion content
Test with Haiku, Sonnet, and Opus

Expected Skill Behaviors

Meeting Intelligence evaluations should verify:

Notion Context Gathering

Searches workspace for relevant context first
Fetches specific pages (not generic)
Extracts key information from Notion content
Cites sources using mention-page tags

Claude Research Integration

Adds industry context, frameworks, or best practices
Enrichment is relevant and valuable (not filler)
Clearly distinguishes Notion facts from Claude insights
Research complements (doesn't replace) Notion content

Two-Document Creation

Internal Pre-Read: Comprehensive, includes strategy, recommendations, detailed pros/cons
External Agenda: Professional, focused on meeting flow, no internal strategy
Both documents are clearly labeled
Documents are cross-linked

Document Quality

Pre-read follows structure: Overview → Background → Current Status → Context & Insights → Discussion Points
Agenda follows structure: Details → Objective → Agenda Items (with times) → Decisions → Actions → Resources
Titles include date or meeting context
Content is actionable and meeting-ready

Creating New Evaluations

When adding Meeting Intelligence evaluations:

Test different meeting types - Decision, status, brainstorm, 1:1, sprint planning, retrospective
Vary complexity - Simple updates vs. complex strategic decisions
Test with/without Notion content - Rich workspace vs. minimal existing pages
Verify enrichment value - Is Claude research genuinely helpful?
Check internal/external distinction - Is sensitive info kept in pre-read only?

Example Success Criteria

Good (specific, testable):

"Creates TWO documents (internal pre-read + external agenda)"
"Internal pre-read marked 'INTERNAL ONLY' or 'For team only'"
"Cites at least 2-3 Notion pages using mention-page tags"
"Agenda includes time allocations for each section"
"Claude enrichment includes decision frameworks or best practices"

Bad (vague, untestable):

"Creates meeting materials"
"Gathers context effectively"
"Prepares well"