Canaria

Changelog

Product updates and improvements

new

New Website Launch

Launched the new Canaria website with dataset catalog, methodology documentation, data schema explorer, provider comparison, and solutions pages. Built on Next.js with a modern dark theme.

new

Automated Sample Generation Pipeline

Launched AI-powered sample generation. When you request a sample, Claude AI ("Brian") builds an optimized ClickHouse query based on your use case, executes it, reviews quality, and delivers a CSV with signed download link via email. Up to 3 quality review iterations per sample.

improved

Skills Taxonomy Expanded to 37,000+

Expanded the skills taxonomy to 37,000+ technical skills, 3,000+ certifications, and 400+ soft skills. Added NLP relevance filtering to reduce false positives (e.g., "Java" correctly excluded from Barista postings).

improved

Salary Prediction Model v2

Retrained the salary prediction model on 50M+ Glassdoor/Indeed observations. MAPE improved to under 15%. Coverage for 2023+ data now reaches 85-95%.

new

Canaria Job Intelligence Platform

Official launch of the Canaria Job Intelligence Platform with 900M+ unique deduplicated job postings, 82 enriched fields, and the Model Garden NLP pipeline. Coverage from 2022 to present with daily incremental updates.