Prosus NV (via Public) / Towards MLOps: technical capabiliti

Prosus NV (via Public) / Towards MLOps: technical capabilities of a machine learning platform


Towards MLOps: technical capabilities of a machine learning platform
The choice of the technology and tool that delivers this functionality is crucial, as it's a dependency of the ML pipeline. Traditionally, hadoop-based data lakes would have a workflow manager like Oozie or Azkaban to perform such activities. Projects like Airflow and Luigi dominated that space by providing an independent tool outside of the hadoop ecosystem, as companies moved their data and workloads to cloud-based data lakes. Currently, Airflow is the leading workflow management system for data processing.
2.4 Data labeling
Labeled data is frequently required to develop machine learning models. When this labeled data is not available, a data labeling activity may need to happen as part of an ML project, to create an initial training dataset. Within Prosus group, our classifieds business OLX has identified the importance of the labeling activity also at the end of the ML pipeline. For example, the OLX moderation team labels accounts flagged as fraudsters. They essentially validate the predicted labels of the fraud detection models, setting up the ground truth dataset for the next iterations of the CT.

Related Keywords

Turkey , Brazil , Turk , Databricks Mlflow , Lyft Amundsen , Ibm , Amazon Mechanical Turk , Data Catalog , Continuous Training , Hyper Parameter Optimization , Neural Architecture Search , Weights Biases , வான்கோழி , பிரேசில் , துருக்கி , லிஃப்ட் ஏமந்‌ட்ஸெந் , ஐபீயெம் , அமேசான் இயந்திர துருக்கி , தகவல்கள் அட்டவணை , தொடர்ச்சியான பயிற்சி , நரம்பியல் கட்டிடக்கலை தேடல் ,

© 2025 Vimarsana