January 2025
These features and Databricks platform improvements were released in January 2025.
Note
Releases are staged. Your Databricks account might not be updated until a week or more after the initial release date.
AI Gateway now supports provisioned throughput (Public Preview)
January 10, 2025
Mosaic AI Gateway now supports Foundation Model APIs provisioned throughput workloads on model serving endpoints.
You can now enable the following governance and monitoring features on your model serving endpoints that use provisioned throughput:
Permission and rate limiting to control who has access and how much access.
Payload logging to monitor and audit data being sent to model APIs using inference tables.
Usage tracking to monitor operational usage on endpoints and associated costs using system tables.
Traffic routing to minimize production outages during and after deployment.