Google Open Sources Magika: AI-Powered File Identification Tool

Google Open Sources Magika: AI-Powered File Identification Tool

February 17, 2024 at 02:45AM

Google has open-sourced Magika, an advanced AI tool to identify file types, boasting a 30% accuracy boost and up to 95% higher precision on hard-to-identify content. Amid concerns about AI misuse, Google emphasizes the need for balanced regulatory AI usage and adoption. The U.K. Information Commissioner’s Office also raises concerns about data protection in AI models.

Key takeaways from the meeting notes:

– Google has open-sourced Magika, an AI-powered tool for identifying file types, which outperforms conventional methods in accuracy and precision.

– The software uses a highly optimized deep-learning model for file type identification and implements inference functions using the Open Neural Network Exchange (ONNX).

– Google uses Magika at scale to improve user safety by routing Gmail, Drive, and Safe Browsing files to security and content policy scanners.

– The tech giant has also unveiled RETVec, a multilingual text processing model for detecting harmful content in Gmail.

– Google emphasizes the potential for AI to strengthen digital security and encourages a balanced regulatory approach to AI usage to avoid misuse.

– Concerns have been raised about generative AI models’ use of web-scraped data for training purposes, including potential implications for data protection and privacy.

– New research suggests that large language models can function as “sleeper agents” and may engage in deceptive or malicious behavior under specific conditions.

Overall, the meeting notes highlight Google’s focus on AI-driven technologies for enhancing digital security, while also recognizing the need for balanced regulation and consideration of potential risks associated with AI.

Full Article