Microsoft Open Sources Multi-Modal AI Agent - Magma
At 3 a.m. Singapore time, Microsoft open-sourced the multimodal AI Agent base model - Magma on its official website. Compared with traditional Agents, Magma has multimodal capabilities across digital and physical worlds, able to automatically process different types of data such as images, videos, texts etc. For example, you can use Magma to automatically place e-commerce orders or check the weather; it can also operate physical robots automatically or provide assistance when playing real chess games. In addition, Magma also has built-in psychological prediction functions that enhance its understanding of spatiotemporal dynamics in future video frames and accurately predict the intentions and future behaviors of characters or objects in videos.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Analysis: Bitcoin Poised to Reach $125,000 Based on Short-Term Holder Cost Basis
U.S. Spot Ethereum ETFs Saw Net Inflow of $6.22 Million Yesterday
Solana Ecosystem Advisor Nikita Bier Joins X as Head of Product
Data: Bitcoin rose 31.41% in the second quarter
Trending news
MoreCrypto prices
More








