Course Outline
Introduction
- Developing effective algorithms for pattern recognition, classification, and regression.
Setting Up the Development Environment
- Python libraries
- Online versus offline editors
Overview of Feature Engineering
- Input and output variables (features)
- Advantages and disadvantages of feature engineering
Types of Problems Found in Raw Data
- Unclean data, missing data, etc.
Pre-Processing Variables
- Addressing missing data
Handling Missing Values in the Data
Working with Categorical Variables
Converting Labels into Numbers
Managing Labels in Categorical Variables
Transforming Variables to Enhance Predictive Power
- Numerical, categorical, date, etc.
Cleaning a Dataset
Machine Learning Modeling
Handling Outliers in Data
- Numerical variables, categorical variables, etc.
Summary and Conclusion
Requirements
- Experience with Python programming.
- Familiarity with Numpy, Pandas, and scikit-learn.
- Understanding of Machine Learning algorithms.
Target Audience
- Developers
- Data Scientists
- Data Analysts
Testimonials (2)
the ML ecosystem not only MLFlow but Optuna, hyperops, docker , docker-compose
Guillaume GAUTIER - OLEA MEDICAL
Course - MLflow
I enjoyed participating in the Kubeflow training, which was held remotely. This training allowed me to consolidate my knowledge for AWS services, K8s, all the devOps tools around Kubeflow which are the necessary bases to properly tackle the subject. I wanted to thank Malawski Marcin for his patience and professionalism for training and advice on best practices. Malawski approaches the subject from different angles, different deployment tools Ansible, EKS kubectl, Terraform. Now I am definitely convinced that I am going into the right field of application.