
E919 - Even GenAI uses Wikipedia as a source
Published: February 20, 2026
Duration: 26:54
Ryan is joined by Philippe Saade, the AI project lead at Wikimedia Deutschland, to dive into the Wikidata Embedding Project and how their team vectorized 30 million of Wikidata’s 119 million entries for semantic search. They discuss how this project helped offload the burden that scraping was creating for their sites, what Wikimedia.DE is doing to maintain data integrity for their entries, and the importance of user feedback even as they work to bring Wikipedia’s vast knowledge to people building open-source AI projects.
Episode notes:
Wikimedia.DE announced the Wikidata Embedding Project with MCP suppor...