Oracle makes fully managed service for GenAI generally available
Cloud computing firm Oracle has made its fully managed service for generative artificial intelligence generally available. Called the Oracle Cloud Infrastructure (OCI) Generative AI service offers large language models from Cohere and Llama 2 from Meta to address business use cases via APIs. The platform also includes multilingual capabilities that support over 100 languages, an improved GPU cluster management experience, and flexible fine-tuning options. This service can be either used through Oracle Cloud or on-premise through OCI Dedicated Region.
“Instead of providing a tool kit that requires assembling, we are offering a powerful suite of pre-built generative AI services and features that work together to help customers solve business problems smarter and faster,” said Greg Pavlik, senior vice president, AI and Data Management, Oracle Cloud Infrastructure.
Oracle is also offering OCI Generative AI Agents service with a retrieval augmented generation (RAG) agent, in beta. To be sure, RAG is an approach to optimise the output of large language models (LLMs) by generating more targeted information on specific domain retrieved from data/documents, without modifying the underlying model. The OCI Generative AI Agents service combines the capabilities of LLMs and enterprise search built on OCI OpenSearch to provide contextualized results that are enhanced with enterprise data. While initial beta release supports OCI OpenSearch, upcoming releases will have wider range of data search and aggregation tools.
Lastly, Oracle is also expanding the capabilities of OCI Data Science to help customers build, train, deploy, and manage LLMs with open source libraries such as Hugging Face’s Transformers or PyTorch. The new OCI Data Science AI Quick Actions feature, which will be in beta next month, enables no-code access to a variety of open-source LLMs, including leading providers such as Meta or Mistral AI.
Oracle’s new offering rivals Amazon Bedrock platform which is also a fully managed service that offers foundational models accessible via API for users to build and scale their generative AI applications.