A machine learning system is a computer system that is responsible for managing the data and the programs that train and operate the machine learning models that power an AI-enabled application or service.
Machine learning systems (ML systems) can be categorized into four different types:
Real-time, interactive applications differ from the other machine learning systems as they often use models as external network callable services that are hosted on standalone model serving infrastructure. Batch, stream processing, and embedded/edge machine learning systems typically embed the model as part of the system and invoke the model via a function or inter-process call.
The following are examples of the four different types of machine learning systems:
Tiktok builds its personalized recommendations engine using ML and a real-time feature store that provides historical user information and context to better personalize recommendations.
Machine learningsystems are both trained and operated using cleaned and processed data (called features), created by a program called a feature pipeline. The feature pipeline writes its output feature data to a feature store that feeds data to both the training pipeline (that trains the model) and the inference pipeline. The inference pipeline makes predictions on new data that comes from the feature pipeline. Real-time, interactive ML systems also take new data as input from the user. Feature pipelines and inference pipelines are operational services - part of the operational ML system. In contrast, a ML system also has an offline component - model training. The training of models is typically not an operational part of a ML system. Training pipelines can be run on separate systems using separate resources (e.g., GPUs). Models are sometimes retrained on a schedule (e.g., once day/week/etc), but are often retrained when a new improved model becomes available, e.g., because new training data is available or the existing model’s performance has degraded and the model needed to be retrained on more recent data.