Sakura Internet launched “Sakura AI Engine,” an inference API platform for generative AI. Accessible through the “Sakura Cloud” control panel, it allows users to incorporate foundational models, including large-scale language models (LLMs), into applications via API. Sakura AI Engine is based on the “High Firepower” cloud service for generative AI and provides multiple domestic and international foundational models and search augmentation generation (RAG) functions via API. This allows companies to select the optimal foundational model based on their objectives and performance requirements and incorporate generative AI-based applications into their own services.
The cloud-based execution environment eliminates the need for infrastructure construction, as no computing infrastructure or network configuration is required. Various AI functions are provided as REST APIs, facilitating application integration and prototype development. The RAG function, which connects with vector databases, can be accessed via API, enabling chatbots and FAQs that utilize in-house data. NVIDIA GPU resources are used for inference processing. Since foundational models can be selected on an infrastructure comprised of domestic data centers operated by Sakura Internet, the system can be used entirely within Japan. Two types of plans are available: the “Free Platform Model Plan” and the “Pay-As-You-Go Plan.” If the free usage limit common to both plans is exceeded, the Free Platform Model Plan applies rate control to API requests, while the Pay-As-You-Go Plan charges for excess usage. There is a limit to the number of applications for the Free Platform Model Plan, and once the limit is reached, new applications will no longer be accepted.
こちらもお読みください: Salesforce Japan debuts AI sales Coaching with Agentforce
In conjunction with the launch of Sakura’s AI Engine service, Sakura Internet has changed the name of its fully managed execution platform for generative AI, “Sakura’s Generative AI Platform,” to “Sakura’s AI,” a business platform for generative AI. Going forward, under Sakura‘s AI, the company plans to gradually expand various services that utilize generative AI, supporting corporate operational efficiency and business growth.
ソース ヤフー