Imagine a world where cutting-edge AI doesn't understand your language or culture. Sounds frustrating, right? That's the reality for many communities worldwide. Microsoft's new initiative, Project Gecko, is tackling this head-on, aiming to create AI that's truly inclusive.
Generative AI is revolutionizing our world, powering amazing tools and apps. But here's the catch: this technology often struggles with diverse languages and cultural nuances. Microsoft acknowledges that current AI systems can perform poorly and fail to reflect the social and cultural realities of every population.
Project Gecko is Microsoft's response to this challenge. It's all about developing AI that's equitable and inclusive. Ashley Llorens, Corporate Vice President and Managing Director of Microsoft Research Accelerator, put it perfectly: "Building AI systems from the ground up, shaped by the knowledge, languages, and modalities of the global majority, yields more innovative, useful solutions for a great number of people." This is a crucial step towards adapting and deploying AI widely in low-resource settings.
So, what exactly is Project Gecko?
It's a project focused on building AI systems that understand local languages, incorporate culturally relevant content, and engage users through various methods like text, voice, and video.
Microsoft has brought together researchers from Microsoft Research Africa (Nairobi), Microsoft Research India, the Microsoft Research Accelerator (United States), and Digital Green to work on this.
A key part of Project Gecko is a new AI system called MMCTAgent (Multimodal Critical Thinking Agent Framework). This system analyzes information from speech, images, and videos to provide relevant, context-aware responses. Think of it as an AI that truly understands what you're saying and showing it, leading to more helpful and accurate results. Microsoft says this is designed to improve experimental frontier models by supporting domain-specific tools that extend their capabilities.
But here's where it gets controversial... Many low-cost connected devices in the Global South don't have the computing power to run large language models (LLMs). That's why Project Gecko researchers are also focusing on Small Language Models (SLMs). This means the AI can still be powerful, even on less advanced devices.
The initial focus of Project Gecko is on small farms in India and Kenya. The team plans to expand into healthcare, education, and retail.
Microsoft has already used Project Gecko technologies to enhance FarmerChat, a speech-first AI assistant by Digital Green. The goal is to help farmers use speech or text to ask questions and receive actionable answers with step-by-step instructions in text, voice, and relevant video in their preferred language.
Microsoft's field studies in India and Kenya showed significant improvements in response quality, usability, and user trust compared to existing models. This is a huge step forward, but is it enough?
What do you think? Are you excited about the potential of inclusive AI? Do you see any challenges or limitations with this approach? Share your thoughts in the comments!