--- name: seo-international description: | International SEO audit and hreflang validation for multi-language and multi-region websites. Triggers: hreflang, international SEO, multi-language, multi-region, content parity, x-default, ccTLD, 다국어 SEO. --- # International SEO Audit ## Purpose Audit international SEO implementation: hreflang tags, URL structure patterns, content parity across language versions, redirect logic, and Korean expansion strategies. Identify issues preventing proper multi-language indexing. ## Core Capabilities 1. **Hreflang Validation** - Bidirectional links, self-reference, x-default, language code validation 2. **URL Structure Analysis** - ccTLD vs subdomain vs subdirectory pattern detection 3. **Content Parity Audit** - Page count comparison, key page availability across languages 4. **Redirect Logic Audit** - IP-based, Accept-Language redirects, forced redirect detection 5. **Korean Expansion** - Priority markets (ja, zh, en), CJK URL issues, regional search engines ## MCP Tool Usage ### SEO Data ``` our-seo-agent CLI: Primary country metrics source (future); use --input for pre-fetched JSON WebSearch / WebFetch: Supplementary international data ``` ### Notion for Report Storage ``` mcp__notion__notion-create-pages: Save audit report to SEO Audit Log database ``` ### WebSearch for Best Practices ``` WebSearch: Research hreflang implementation guides and regional search engine requirements ``` ## Workflow ### 1. Hreflang Validation 1. Fetch target URL and extract hreflang tags (HTML head, HTTP headers) 2. If sitemap provided, also extract xhtml:link hreflang from XML sitemap 3. Validate language codes (ISO 639-1) and region codes (ISO 3166-1) 4. Check bidirectional links (if A references B, B must reference A) 5. Verify self-referencing tags on each page 6. Check x-default tag presence and validity 7. Detect conflicting hreflang for same language-region 8. Report all errors with severity levels ### 2. URL Structure Analysis 1. Crawl known language versions of the site 2. Classify pattern: ccTLD (example.kr), subdomain (ko.example.com), subdirectory (example.com/ko/) 3. Check consistency across all language versions 4. Provide recommendation based on business context ### 3. Content Parity Audit 1. Discover all language versions from hreflang tags 2. Count pages per language version 3. Check availability of key pages (home, about, contact, products/services) 4. Compare content freshness (last modified dates) across versions 5. Flag significant gaps in content availability ### 4. Redirect Logic Audit 1. Test URL with different Accept-Language headers (ko, en, ja, zh) 2. Check if redirects are forced (no way to override) vs suggested (banner/popup) 3. Flag forced geo/language redirects as anti-pattern 4. Recommend proper implementation (suggest, do not force) ### 5. Korean Expansion Analysis (Optional) 1. Analyze current traffic by country via our-seo-agent CLI or pre-fetched data 2. Recommend priority target markets for Korean businesses 3. Check CJK-specific URL encoding issues 4. Advise on regional search engines (Naver, Baidu, Yahoo Japan) ## Output Format ```markdown ## 다국어 SEO 감사: [domain] ### Hreflang 검증 - 검사 페이지 수: [count] - 오류: [count] (심각 [count], 경고 [count]) - 양방향 링크 누락: [list] - 자기참조 누락: [list] - x-default: [있음/없음] ### URL 구조 - 패턴: [ccTLD/subdomain/subdirectory] - 일관성: [양호/비일관] - 권장사항: [recommendation] ### 콘텐츠 동등성 | 언어 | 페이지 수 | 핵심 페이지 | 최신성 점수 | |------|----------|------------|-----------| | ko | 150 | 5/5 | 90 | | en | 120 | 4/5 | 75 | ### 리다이렉트 로직 - IP 기반 리다이렉트: [있음/없음] - 언어 기반 리다이렉트: [있음/없음] - 강제 리다이렉트: [있음/없음] (없어야 정상) ### 종합 점수: [score]/100 ### 권장 조치사항 1. [Priority fixes in Korean] ``` ## Notion Output (Required) All audit reports MUST be saved to OurDigital SEO Audit Log: - **Database ID**: `2c8581e5-8a1e-8035-880b-e38cefc2f3ef` - **Properties**: Issue (title), Site (url), Category (International SEO), Priority, Found Date, Audit ID - **Language**: Korean with English technical terms - **Audit ID Format**: INTL-YYYYMMDD-NNN ## Limitations - Cannot detect server-side IP-based redirects without proxy testing - Content language detection requires sufficient text content - Large sites (10,000+ pages) require sampling approach - Sitemap-based hreflang requires XML sitemap access