ML Concepts – What is Feature Scaling? – Journal of Intelligent Infrastructure

ML Concepts – What is Feature Scaling?

Feature Scaling Feature scaling is technique that will get mean and standard deviation of your feature in order to scale your feature. If we apply the feature scaling before the splitting the dataset, then it takes the mean and standard deviation of all the values including training set. It will cause the information leakage. We…

Dr. Pranay Jha

May 30, 2022

No comments

2 minutes

Read Time

Feature Scaling

Feature scaling is technique that will get mean and standard deviation of your feature in order to scale your feature. If we apply the feature scaling before the splitting the dataset, then it takes the mean and standard deviation of all the values including training set. It will cause the information leakage. We do not need to apply feature scaling for all the machine learning models, but for few of them. Like Regression model do not required

Simple meaning: Let’s suppose you have a dataset has income details or network load details. Some time network is going to 10 GBPS, and sometimes it is on 1 GBPS, some time it is on 100 MBPS. When you will use this data, or create a graph, you need high scale. Like you need to create a graph till 10000 KBs. So, you will scale the data to mitigate this. Means you will scale into 1-100%. Means if 10 GBPS is used then 100 %, if 100 MB is used then 0.1%. Now you can use this data in simple. Like we create for CPU utilization capacity report. We have data with 340 MHz used, we convert to 2% utilization, if its 2k MHz used, then 20% utilization.

Feature Scaling Techniques:

Standardisation
Normalisation

Standardization: Consist of subtracting each value of feature by the mean of all the value of your features and dividing by standard deviation which is square root of variants. Standardization actual well all the time. This is technique we use most of the time. It is calculated as:

X_new = (X - mean)/Std

Normalization: Subtracting each value of feature with the minimum value of all features, then dividing by maximum value of feature and minimum value of features. Normalization is recommended when you have normal distribution. This is for some specific time we use. It is calculated as:

X_new = (X - X_min)/(X_max - X_min)

Only apply feature scaling on numerical values.

About The Author

Dr. Pranay Jha

Dr. Pranay Jha is a Cloud and AI Consultant with 18+ years of experience in hybrid cloud, virtualization, and enterprise infrastructure transformation. He specializes in VMware technologies, multi-cloud strategy, and Generative AI solutions. He holds a PhD in Computer Applications with research focused on Cloud and AI, has published multiple research papers, and has been a VMware vExpert since 2016 and a VMUG Community Leader.

See author's posts

Discover more from Journal of Intelligent Infrastructure – By Dr Pranay Jha

Subscribe to get the latest posts sent to your email.

Architect’s Toolkit

PJ’s Tools

VMware Cloud Foundation

Nutanix

AI & Cloud-Native Platform

Architecture & Design

About the Author

Dr Pranay Jha

You May Have Missed

View All

AI Stack, AI/ML

Semantic Kernel, AutoGen, and Microsoft Agent Framework on Azure (Azure Gen AI Series, Part 21)

July 5, 2026
AI Stack, AI/ML

Data Prep, Chunking, and Indexing for RAG on Azure (Azure Gen AI Series, Part 20)

July 5, 2026
AI Stack, AI/ML

Distributed Training on Azure ML with ND GPU Clusters (Azure Gen AI Series, Part 19)

July 5, 2026
AI Stack, AI/ML

Deploy Open Models on Azure Machine Learning with Managed Compute (Azure Gen AI Series, Part 18)

July 4, 2026
AI Stack, AI/ML

Azure OpenAI Distillation and Stored Completions (Azure Gen AI Series, Part 17)

July 4, 2026