Novita AI Becomes Official Inference Partner for Hugging Face
Novita AI has joined Hugging Face as an official inference partner, announced in a press release. The partnership enables over five million developers on Hugging Face to deploy AI models instantly, without needing to configure infrastructure or manage containers.
The collaboration introduces a new "Deploy on Novita" option on Hugging Face, letting developers turn open-source models into production-ready APIs in seconds. Novita AI also served as a day-zero launch partner for inference on Google's Gemma 4 model.
According to the release, developers using Novita’s inference services can achieve time-to-first-token speeds as low as 50 milliseconds and reduce costs by up to 50% compared to typical endpoints. The partnership aims to simplify deployment for open-source models while maintaining high performance and scalability.
We hope you enjoyed this article.
Consider subscribing to one of our newsletters like Enterprise AI Brief or Daily AI Brief.
Also, consider following us on social media:
More from: Enterprise
Subscribe to Enterprise AI Brief
Weekly report on AI business applications, enterprise software releases, automation tools, and industry implementations.
Market report
AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation
The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.
Read more