BUSINESS ANALYTICS (MBA-7302) HPTU SYLLABUS OF MBA 3RD SEM
⭐ UNIT – I (FULL DETAILED NOTES, VERY EASY LANGUAGE)
1️⃣ STATISTICS – BASIC INTRODUCTION
Definition (Easy English):
Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting numerical data for decision making.
Why Statistics is Important in Business?
-
Sales forecasting
-
Market analysis
-
Quality control
-
Inventory management
-
Financial analysis
-
Demand prediction
-
Risk assessment
Types of Statistics
-
Descriptive Statistics – describes data (mean, median, charts).
-
Inferential Statistics – draws conclusions from sample to population (probability, hypothesis, regression).
2️⃣ MEASURES OF CENTRAL TENDENCY (Center of Data)
Yeh batata hai data ka “centre” kahan lie karta hai.
A) MEAN (Arithmetic Mean)
Mean = Data values ka simple average
Formula:
Example (easy):
Data: 5, 10, 15, 20
Mean = (5+10+15+20)/4 = 12.5
Advantages:
✔ Easy to calculate
✔ Uses all data
Disadvantages:
✘ Outliers se effect hota hai
Example:
10, 12, 14, 100 → mean grows too much.
B) MEDIAN
Median = Middle value (jab data ko ascending me arrange karein)
Steps:
-
Arrange data
-
If odd number → middle value
-
If even number → middle 2 values ka average
Example:
Data: 2, 4, 7
Median = 4
Even example:
12, 15, 18, 27
Median = (15+18)/2 = 16.5
Why important?
Median outliers se affect nahi hota → e.g., income distribution.
C) MODE
Most frequently occurring value.
Example:
5, 7, 7, 9, 11
Mode = 7
Use:
✔ shoe size
✔ customer buying pattern
✔ product color preference
3️⃣ MEASURES OF DISPERSION (Data Spread)
Center ke alawa hume yeh bhi jaana hota hai ki data kitna spread/variability show karta hai.
A) RANGE
Very basic measure.
Example:
Prices: 100, 150, 200
Range = 200 – 100 = 100
B) QUARTILE DEVIATION (Optional – extra)
Middle 50% data ka spread.
Example:
Q3 = 80, Q1 = 60
QD = (80 – 60)/2 = 10
C) MEAN DEVIATION (MD)
Deviation from mean/median ka average.
Example:
Data: 4, 6, 8
Mean = 6
|4-6|=2, |6-6|=0, |8-6|=2
MD = (2+0+2)/3 = 1.33
D) VARIANCE & STANDARD DEVIATION (Most Important)
Variance:
Data mean se on average kitna far hai.
Standard Deviation (SD):
Variance ka square root
Example:
Data: 2, 4, 6
Mean = 4
(2–4)² = 4
(4–4)² = 0
(6–4)² = 4
Variance = (4+0+4)/3 = 2.66
SD = √2.66 = 1.63
Importance:
✔ Best measure of reliability
✔ Used in finance (risk), inventory, sales fluctuations
4️⃣ MEASURES OF SHAPE (Skewness)
Data symmetric hai ya tilted?
A) Skewness
Skewness tells:
-
data right side tilt hai?
-
data left side tilt hai?
Types:
1. Positive (Right) Skewness
Tail right side → long right tail
Mean > Median > Mode
Example: Income data
(bohot log low income, few very high)
2. Negative (Left) Skewness
Tail left side → long left tail
Mode > Median > Mean
3. Zero Skewness (Symmetric)
Mean = Median = Mode
Example: Normal distribution (bell curve)
5️⃣ KURTOSIS (Peak of Data)
Definition:
Kurtosis shows data curve kitni peaked ya flat hai.
Types:
-
Leptokurtic → sharp peak (more concentrated)
-
Mesokurtic → normal peak
-
Platykurtic → flat peak (more spread)
6️⃣ CHEBYSHEV’S THEOREM (VERY IMPORTANT)
Chebyshev’s rule applies to all types of data (normal or not normal).
Formula:
Remember Values:
| k (SD) | At least within Mean ± k SD |
|---|---|
| 2 SD | 75% |
| 3 SD | 88.9% ≈ 89% |
| 4 SD | 93.75% |
Example:
If mean = 50 and SD = 10
k = 2
Range = 50 ± 2×10 = 50 ± 20 = 30 to 70
At least 75% data will lie in this range.
7️⃣ Additional Concepts (Exam me puchte hain)
Population vs Sample
Population:
Entire group of interest
→ example: all customers
Sample:
Part of population
→ example: 100 customers out of 10,000
Parameter vs Statistic
● Parameter → population ka measure (mean = μ)
● Statistic → sample ka measure (mean = X̄)
Primary vs Secondary Data
Primary data: collected firsthand
Example: surveys, interviews
Secondary: already available
Example: newspaper, report, website
⭐ UNIT–I Summary Table (Exam Writing Ready)
| Topic | Meaning | Best For | Example |
|---|---|---|---|
| Mean | Average | numerical data | sales avg |
| Median | Mid value | skewed data | income data |
| Mode | Most frequent | categorical | shoe size |
| Range | Max–Min | quick spread | 100–10 |
| Variance | Avg squared deviation | accuracy | finance risk |
| SD | Square root of variance | reliability | price fluctuation |
| Skewness | Tilt | distribution shape | right skew incomes |
| Kurtosis | Peak | shape | leptokurtic peak |
| Chebyshev | Min % within SD | non-normal data | 75% within 2S |
⭐ UNIT – II : PROBABILITY & PROBABILITY DISTRIBUTIONS (FULL DETAILED NOTES)
1️⃣ PROBABILITY – BASIC INTRODUCTION
Definition (Easy):
Probability tells the likelihood/chance of an event happening.
Values lie between 0 and 1.
-
0 → Impossible event
-
1 → Certain event
Example:
Chance of getting head in coin toss = ½ = 0.5
2️⃣ IMPORTANT TERMS
Experiment:
Any action which produces outcomes.
E.g., tossing a coin.
Sample Space (S):
All possible outcomes.
Coin → S = {H, T}
Event (E):
Subset of sample space.
Event = getting head = {H}
3️⃣ TYPES OF PROBABILITY
A) Classical Probability (Theoretical)
Formula:
Example:
Rolling a die → probability of 4 = 1/6.
B) Empirical Probability (Experimental)
Based on observation.
Example: Cricket match mein “win percentage”.
C) Subjective Probability
Based on judgment/experience.
Example: Market trend prediction.
⭐ 4️⃣ LAWS OF PROBABILITY
A) ADDITION LAW (Union of Events)
1) For Mutually Exclusive Events
Events that cannot occur together.
Example: Getting head and tail at same time.
Formula:
2) For Non-Mutually Exclusive Events
Example: Rolling a die → event A: even number, event B: number > 3.
Formula:
B) MULTIPLICATION LAW (Intersection of Events)
1) Independent Events
One event does not affect other.
Example: Coin + die.
2) Dependent Events
Example: drawing cards without replacement.
⭐ 5️⃣ BAYES' THEOREM (Very Important)
Used when:
● Reverse probability chaiye
● Condition-based questions
Formula:
Example (easy):
A machine A makes 60% items, B makes 40%.
Defective rate: A → 2%, B → 3%.
Find: Probability defective item came from machine A.
(Ye exam me bohot common hai.)
⭐ 6️⃣ PROBABILITY DISTRIBUTIONS (Discrete & Continuous)
Unit-II me 3 main distributions hain:
✔ Binomial
✔ Poisson
✔ Normal
A) BINOMIAL DISTRIBUTION
Situation jahan:
-
experiment repeat hota hai
-
2 possible outcomes (success/failure)
-
probability = constant
-
trials independent
Example:
Coin toss 10 times
Success = head
p = 0.5
Formula:
where
p = success probability
q = 1 − p
n = trials
Example:
Probability of getting exactly 3 heads in 5 tosses
n = 5, k = 3, p = 0.5
B) POISSON DISTRIBUTION
Used when rare events occur.
Examples:
✔ No. of road accidents per day
✔ No. of telephone calls per minute
✔ No. of defects in cloth
Formula:
where λ = mean = variance
Example:
λ = 2 (mean calls per minute = 2)
Find probability of 0 calls:
⭐ Difference: Binomial vs Poisson
| Binomial | Poisson |
|---|---|
| Two outcomes | Rare event count |
| Fixed n | No fixed n |
| p constant | p very small |
| k successes in n trials | k occurrences in interval |
C) NORMAL DISTRIBUTION (MOST IMPORTANT)
Also called Gaussian distribution.
Characteristics:
✔ Bell-shaped curve
✔ Symmetric
✔ Mean = Median = Mode
✔ Total area = 1
✔ 68–95–99.7 Rule
⭐ Empirical Rule (VERY IMPORTANT)
For normal distribution:
| Range | Percentage |
|---|---|
| Mean ± 1 SD | 68% |
| Mean ± 2 SD | 95% |
| Mean ± 3 SD | 99.7% |
Example:
Mean = 50, SD = 10
95% values lie between
50 ± 20 → 30 to 70
⭐ Z-SCORE (STANDARD SCORE)
Converts normal data into standard normal form.
Example:
X = 70, mean = 50, SD = 10
⭐ 7️⃣ Comparison Table (Exam ke liye best)
| Distribution | Type | When used | Example |
|---|---|---|---|
| Binomial | Discrete | Success/failure | Heads in 10 tosses |
| Poisson | Discrete | Rare events | Accidents/day |
| Normal | Continuous | Natural data | Height, weight |
⭐ 8️⃣ UNIT–II Ready-Made 5 Mark Answer Structure
Q. Explain Bayes' Theorem.
Intro:
Bayes’ theorem is used to find reverse or conditional probabilities.
Formula:
Example: (Write machine example)
Conclusion:
It is widely used in business analytics, machine learning, medical testing, and classification.
⭐ UNIT – III : CORRELATION & REGRESSION (FULL NOTES)
1️⃣ CORRELATION ANALYSIS
Correlation measures:
✔ two variables ke beech strength kitni hai
✔ direction (positive/negative)
✔ linear relationship
Easy Definition:
Correlation tells how strongly two variables move together.
Example:
-
Height ↑ → Weight ↑ (positive correlation)
-
Price ↑ → Demand ↓ (negative correlation)
⭐ Types of Correlation
1) Positive Correlation
Both variables increase or decrease together.
Examples:
✔ income & expenditure
✔ length & weight
2) Negative Correlation
One increases, the other decreases.
Examples:
✔ price & demand
✔ speed & time taken
3) Zero Correlation
No relationship.
Example: height & intelligence
⭐ Methods of Correlation (Syllabus ke according 2 methods)
✔ Rank Correlation
✔ Karl Pearson’s Coefficient of Correlation
2️⃣ Rank Correlation (Spearman’s Rank Correlation)
Used when:
✔ data ranked form me ho
✔ small samples
✔ qualitative data (performance, preference)
Formula:
where:
d = difference of ranks
n = number of observations
Example (Easy):
Students rank in Maths & English:
| Student | Maths Rank | English Rank |
|---|---|---|
| A | 1 | 2 |
| B | 2 | 1 |
| C | 3 | 3 |
d = difference
A → 1 - 2 = -1 → d² = 1
B → 2 - 1 = 1 → d² = 1
C → 3 - 3 = 0 → d² = 0
Total = 2
→ Moderate positive correlation.
3️⃣ KARL PEARSON’S COEFFICIENT OF CORRELATION (r)
Used when:
✔ data in numeric form
✔ finding linear correlation
Formula:
But exam me shortcut formula likho:
⭐ Interpretation of r
| Value of r | Interpretation |
|---|---|
| +1 | Perfect positive |
| 0.7 to 0.9 | High positive |
| 0.3 to 0.6 | Moderate positive |
| 0 to 0.3 | Low positive |
| 0 | No correlation |
| –0.3 to –0.6 | Moderate negative |
| –0.7 to –1 | High/Perfect negative |
⭐ Properties of Correlation
-
r lies between –1 and +1
-
r has no units
-
r is symmetric ⇒ r(X,Y) = r(Y,X)
-
r only measures linear relationship
-
If r = 0 ⇒ no linear relation
⭐ Difference Between Rank & Pearson Correlation
| Rank Correlation | Pearson Correlation |
|---|---|
| Uses ranks | Uses actual values |
| For qualitative data | For quantitative data |
| Based on differences in ranks | Based on covariance |
| Less accurate | More accurate |
| Non-parametric | Parametric |
4️⃣ REGRESSION ANALYSIS
Regression tells:
✔ effect of one variable on another
✔ prediction/estimation
✔ direction & magnitude of impact
Easy Definition:
Regression is a statistical method for predicting the value of one variable using another variable.
Example:
Predicting sales from advertisement expenditure.
⭐ Types of Regression (Unit syllabus – simple linear regression)
-
Regression of Y on X
-
Regression of X on Y
Regression Line of Y on X
Formula:
Where:
a = intercept
b = slope (regression coefficient)
b (regression coefficient)
a
Regression Line of X on Y
Formula:
where
⭐ Interpretation of Regression Coefficient (b)
b tells:
-
change in dependent variable due to 1 unit change in independent variable.
Example:
b = 0.8
→ X increases by 1 → Y increases by 0.8
⭐ Properties of Regression Coefficients
-
Both regression coefficients generally have same sign as correlation
-
If r = 0 → both regression coefficients = 0
-
b(yx) × b(xy) = r²
-
Regression coefficients are not symmetric
-
Value can be greater than 1
-
Averages always lie on regression line
⭐ Relationship Between Correlation & Regression
If both b positive ⇒ r positive
If both negative ⇒ r negative
Important:
Regression can exist even when correlation is weak, but prediction accuracy low hogi.
⭐ 5️⃣ Solved Example of Regression (Very Easy)
Given data:
| X | Y |
|---|---|
| 2 | 4 |
| 4 | 6 |
| 6 | 8 |
Step 1: Calculate sums
∑X = 12
∑Y = 18
∑XY = (2×4)+(4×6)+(6×8)=8+24+48=80
∑X² = 4+16+36 = 56
n = 3
Step 2: Find b (slope)
Step 3: Find a
Regression line of Y on X:
Means:
X me har 1 unit increase → Y me 1 unit increase.
⭐ 6️⃣ Difference: Correlation vs Regression
| Correlation | Regression |
|---|---|
| Relationship shows | Prediction shows |
| No direction | One-way direction |
| r is unitless | regression has units |
| r between –1 to +1 | b any value |
| Shows strength | Shows rate of change |
⭐ Unit–III Summary (Exam time 2 min revision)
-
Correlation → degree of relationship
-
Pearson r → numerical method (–1 to +1)
-
Rank correlation → qualitative data
-
Regression → prediction + equation
-
Two equations: Y on X, X on Y
-
Regression coefficients related by r² = bxy × byx
UNIT 4 FULL NOTES
(Super Easy + Examples + Exam Oriented)
⭐ PART–A : LINEAR PROGRAMMING (LPP)
1. Linear Programming – Meaning
Definition (Easy English):
Linear Programming is a mathematical technique used to find the best possible solution (maximum profit or minimum cost) under given constraints.
Key Elements:
-
Objective function – Maximize profit OR minimize cost
-
Decision variables – What to produce (x, y etc.)
-
Constraints – Limitations (resources, time, budget)
-
Non-negativity – x, y ≥ 0
⭐ 2. LPP Formulation (How to write LPP)
Steps:
-
Identify decision variables → Let x = product A, y = product B
-
Write objective function
-
Max Z = … OR Min Z = …
-
Identify constraints (resource limits)
-
Add non-negativity (x ≥ 0, y ≥ 0)
Identify decision variables → Let x = product A, y = product B
Write objective function
-
Max Z = … OR Min Z = …
Identify constraints (resource limits)
Add non-negativity (x ≥ 0, y ≥ 0)
⭐ 3. Graphical Method (Only for 2 variables)
Steps:
-
Convert constraints into equations
-
Draw lines on graph
-
Identify feasible region
-
Check corner points
-
Put corner points in objective function
-
Highest value = maximum solution || lowest = minimum
Convert constraints into equations
Draw lines on graph
Identify feasible region
Check corner points
Put corner points in objective function
Highest value = maximum solution || lowest = minimum
⭐ 4. Simplex Method (For more than 2 variables)
Definition:
Simplex is an iterative mathematical method to solve complex LPP problems.
Steps (Simple Language):
-
Convert constraints into equalities (add slack variables)
-
Make initial simplex table
-
Identify entering & leaving variables
-
Perform pivot operations
-
Continue until no negative value in Z-row
(Exam में mostly theory पूछते हैं)
⭐ 5. Sensitivity Analysis
Definition:
Checking how optimal solution changes when parameters (cost, resources, coefficients) are changed.
Why needed?
-
Better decision making
-
Risk reduction
-
Future planning
Components:
-
Shadow price
-
Range of optimality
-
Range of feasibility
⭐ 6. Transportation Problem
Objective: Minimize cost of transporting goods from sources to destinations.
Methods:
-
Initial Solution Methods
-
North West Corner
-
Least Cost Method
-
Vogel’s Approximation Method (Best)
-
Optimality Test
-
MODI method
-
Stepping-stone method
Initial Solution Methods
-
North West Corner
-
Least Cost Method
-
Vogel’s Approximation Method (Best)
Optimality Test
-
MODI method
-
Stepping-stone method
⭐ 7. Assignment Problem
Definition: Assigning persons/jobs in such a way that total cost is minimum or total profit is maximum.
Method Used:
👉 Hungarian Method
Applications:
-
Assigning workers to tasks
-
Machine to jobs
-
Salesmen to territories
-
Teachers to classrooms
⭐ PART–B : DATA ANALYTICS
⭐ 1. Introduction to Analytics
Analytics: Process of collecting, cleaning, analyzing data to find patterns and support decisions.
3 Types:
-
Descriptive Analytics – What happened?
-
Predictive Analytics – What will happen?
-
Prescriptive Analytics – What should be done?
(Exam में mostly first two पूछते हैं)
⭐ 2. Sources of Data
-
Internal data: Sales, HR, finance
-
External data: Government reports, websites
-
Primary data: Surveys, interviews
-
Secondary data: Books, newspapers
Internal data: Sales, HR, finance
External data: Government reports, websites
Primary data: Surveys, interviews
Secondary data: Books, newspapers
⭐ 3. Data Quality Issues
-
Missing data
-
Duplicate values
-
Inconsistency
-
Outliers
-
Human errors
-
Incorrect formats
Missing data
Duplicate values
Inconsistency
Outliers
Human errors
Incorrect formats
⭐ 4. Dealing with Missing Data
Methods:
-
Deletion method – Remove missing rows
-
Mean/Median imputation
-
Regression imputation
-
Interpolation
-
Using default values
⭐ 5. Data Classification
Dividing data into categories (classes).
Types:
-
Binary classification – Yes/No
-
Multi-class classification – A/B/C etc.
-
Ordinal classification – Low/Medium/High
-
Nominal classification – Male/Female
⭐ 6. Descriptive Analytics
Definition: Analyzing historical data to understand what happened.
Tools:
-
Mean, median, mode
-
Charts, graphs
-
Pivot tables
-
Summary statistics
⭐ 7. Predictive Analytics
Definition: Using past data to predict future outcomes.
Types:
-
Univariate Analysis
(Prediction using single variable)
– Example: Predict sales using past sales only. -
Multivariate Analysis
(Prediction using multiple variables)
– Example: Predict sales using price + promotions + season.
Techniques:
-
Regression
-
Time series forecasting
-
Machine learning models
⭐ UNIT 4 → SUPER SHORT SUMMARY
LPP
-
Objective → Max/Min
-
Method → Graphical, Simplex
-
Tools → Slack variable, Feasible region
-
Related → Transportation, Assignment
Objective → Max/Min
Method → Graphical, Simplex
Tools → Slack variable, Feasible region
Related → Transportation, Assignment
Data Analytics
-
Types → Descriptive, Predictive
-
Missing data → Mean/median, deletion
-
Classification → Binary, multi-class
-
Predictive → Regression, models
⭐ BUSINESS ANALYTICS (MBA-7302) – COMPLETE NOTES (UNIT 1 to UNIT 4)
(Hinglish + Examples + Super Easy)
➤ UNIT 1 – BASIC STATISTICS
⭐ 1. Measures of Central Tendency
Meaning:
Central tendency = वो value जो पूरी data का “center” represent करती है.
Types:
1. Mean (Average)
Example: 2,4,6 → Mean = 4
2. Median
Middle value after arranging data.
Example: 3,5,7 → Median = 5
3. Mode
Most frequently occurring value
Example: 2,3,3,4 → Mode = 3
⭐ 2. Measures of Dispersion
Dispersion = Data कितना spread है.
Types:
-
Range = Max – Min
-
Quartile Deviation = Q3 – Q1 / 2
-
Mean Deviation
-
Variance
-
Standard Deviation (σ)
⭐ 3. Skewness & Kurtosis
Skewness – Shape of distribution
-
Positive skew → Tail right side
-
Negative skew → Tail left side
-
Zero skew → Symmetrical
Kurtosis – Peakedness of curve
-
Leptokurtic → Tall, sharp
-
Platykurtic → Flat
-
Mesokurtic → Normal curve
⭐ 4. Chebyshev's Theorem
For any distribution:
-
At least 75% values lie within 2 SD
-
At least 89% values lie within 3 SD
➤ UNIT 2 – PROBABILITY & DISTRIBUTIONS
⭐ 1. Probability Theory
Probability = Likelihood of an event
⭐ 2. Addition Law
Mutually exclusive:
Not mutually exclusive:
⭐ 3. Multiplication Law
Independent:
Dependent:
⭐ 4. Bayes Theorem
Used to find reverse probability
⭐ 5. Probability Distributions
(A) Binomial Distribution
Used when:
-
Fixed trials
-
Success/Failure
-
Probability constant
Example: Coin toss.
Formula:
(B) Poisson Distribution
Used for rare events
Example: Accident, calls, arrivals.
Formula:
(C) Normal Distribution
Bell-shaped, symmetric, mean = median = mode.
Uses:
-
Z-score
-
Probability estimates
-
Quality control
➤ UNIT 3 – CORRELATION & REGRESSION
⭐ 1. Correlation
Measures relationship between two variables.
Range: –1 to +1
Types:
-
+1 → Perfect positive
-
–1 → Perfect negative
-
0 → No relation
⭐ 2. Karl Pearson’s Correlation (r)
⭐ 3. Rank Correlation (Spearman)
Used for ranking data.
⭐ 4. Regression Analysis
Predicting value of one variable using another.
Regression Equation
Regression Coefficient (b)
Indicates slope
If b = 2 → X increases 1 unit → Y increases 2 units
Properties:
-
bxy × byx = r²
-
Same sign as correlation
-
One regression line passes through mean point (x̄, ȳ)
➤ UNIT 4 – LINEAR PROGRAMMING + DATA ANALYTICS
⭐ A. LINEAR PROGRAMMING (LPP)
1. Definition:
A method to maximize profit or minimize cost under given constraints.
2. Components
-
Decision variables
-
Objective function
-
Constraints
-
Non-negativity
⭐ 3. Graphical Method
Steps:
-
Convert constraints
-
Draw lines
-
Feasible region
-
Check corner points
-
Highest value = max / lowest = min
⭐ 4. Simplex Method (Theory)
Used for problems with more than 2 variables.
Steps:
-
Add slack variables
-
Form initial table
-
Entering/leaving variable
-
Pivot
-
Continue until optimum reached
⭐ 5. Transportation Problem
Objective: Minimize cost of transporting goods.
Methods:
-
NWC
-
Least Cost
-
VAM (best)
-
MODI (optimality test)
⭐ 6. Assignment Problem
Objective: Minimum cost assignment.
Method: Hungarian Method
⭐ B. DATA ANALYTICS
⭐ 1. Introduction
Analytics = Converting data into insights.
⭐ 2. Types of Analytics
✔ Descriptive
“What happened?”
Uses: charts, mean, median, reports.
✔ Predictive
“What will happen?”
Uses: regression, forecasting.
⭐ 3. Data Sources
-
Internal
-
External
-
Primary
-
Secondary
⭐ 4. Data Quality Issues
-
Missing values
-
Duplicates
-
Inconsistency
-
Outliers
-
Wrong entry
⭐ 5. Handling Missing Data
-
Deletion
-
Mean/median substitution
-
Regression imputation
-
Interpolation
⭐ 6. Data Classification
-
Binary
-
Multi-class
-
Ordinal
-
Nominal
⭐ 7. Predictive Analytics Methods
-
Regression
-
Machine learning models
-
Time series
⭐ 8. Univariate & Multivariate
-
Univariate → one variable
-
Multivariate → multiple variables
⭐ FINAL SUPER SUMMARY
(10 minutes revision)
UNIT 1 – Stats
Central tendency → Mean/Median/Mode
Dispersion → Range, SD, Variance
Skewness → shape
Kurtosis → peak
Chebyshev → 75% in 2SD
UNIT 2 – Probability
Addition law
Multiplication law
Bayes theorem
Distributions:
-
Binomial → success/failure
-
Poisson → rare events
-
Normal → bell curve
UNIT 3 – Correlation & Regression
Correlation (r → –1 to +1)
Rank correlation
Regression: Y = a + bX
Properties: bxy × byx = r²
UNIT 4 – LPP + Analytics
LPP → maximize/minimize
Graphical method
Simplex (steps)
Transportation (VAM, MODI)
Assignment (Hungarian)
Analytics → descriptive, predictive
Missing data handling
Classification
Types → Descriptive, Predictive
Missing data → Mean/median, deletion
Classification → Binary, multi-class
Predictive → Regression, models
⭐ BUSINESS ANALYTICS (MBA-7302) – COMPLETE NOTES (UNIT 1 to UNIT 4)
(Hinglish + Examples + Super Easy)
➤ UNIT 1 – BASIC STATISTICS
⭐ 1. Measures of Central Tendency
Meaning:
Central tendency = वो value जो पूरी data का “center” represent करती है.
Types:
1. Mean (Average)
Example: 2,4,6 → Mean = 4
2. Median
Middle value after arranging data.
Example: 3,5,7 → Median = 5
3. Mode
Most frequently occurring value
Example: 2,3,3,4 → Mode = 3
⭐ 2. Measures of Dispersion
Dispersion = Data कितना spread है.
Types:
-
Range = Max – Min
-
Quartile Deviation = Q3 – Q1 / 2
-
Mean Deviation
-
Variance
-
Standard Deviation (σ)
⭐ 3. Skewness & Kurtosis
Skewness – Shape of distribution
-
Positive skew → Tail right side
-
Negative skew → Tail left side
-
Zero skew → Symmetrical
Kurtosis – Peakedness of curve
-
Leptokurtic → Tall, sharp
-
Platykurtic → Flat
-
Mesokurtic → Normal curve
⭐ 4. Chebyshev's Theorem
For any distribution:
-
At least 75% values lie within 2 SD
-
At least 89% values lie within 3 SD
➤ UNIT 2 – PROBABILITY & DISTRIBUTIONS
⭐ 1. Probability Theory
Probability = Likelihood of an event
⭐ 2. Addition Law
Mutually exclusive:
Not mutually exclusive:
⭐ 3. Multiplication Law
Independent:
Dependent:
⭐ 4. Bayes Theorem
Used to find reverse probability
⭐ 5. Probability Distributions
(A) Binomial Distribution
Used when:
-
Fixed trials
-
Success/Failure
-
Probability constant
Example: Coin toss.
Formula:
(B) Poisson Distribution
Used for rare events
Example: Accident, calls, arrivals.
Formula:
(C) Normal Distribution
Bell-shaped, symmetric, mean = median = mode.
Uses:
-
Z-score
-
Probability estimates
-
Quality control
➤ UNIT 3 – CORRELATION & REGRESSION
⭐ 1. Correlation
Measures relationship between two variables.
Range: –1 to +1
Types:
-
+1 → Perfect positive
-
–1 → Perfect negative
-
0 → No relation
⭐ 2. Karl Pearson’s Correlation (r)
⭐ 3. Rank Correlation (Spearman)
Used for ranking data.
⭐ 4. Regression Analysis
Predicting value of one variable using another.
Regression Equation
Regression Coefficient (b)
Indicates slope
If b = 2 → X increases 1 unit → Y increases 2 units
Properties:
-
bxy × byx = r²
-
Same sign as correlation
-
One regression line passes through mean point (x̄, ȳ)
➤ UNIT 4 – LINEAR PROGRAMMING + DATA ANALYTICS
⭐ A. LINEAR PROGRAMMING (LPP)
1. Definition:
A method to maximize profit or minimize cost under given constraints.
2. Components
-
Decision variables
-
Objective function
-
Constraints
-
Non-negativity
⭐ 3. Graphical Method
Steps:
-
Convert constraints
-
Draw lines
-
Feasible region
-
Check corner points
-
Highest value = max / lowest = min
⭐ 4. Simplex Method (Theory)
Used for problems with more than 2 variables.
Steps:
-
Add slack variables
-
Form initial table
-
Entering/leaving variable
-
Pivot
-
Continue until optimum reached
⭐ 5. Transportation Problem
Objective: Minimize cost of transporting goods.
Methods:
-
NWC
-
Least Cost
-
VAM (best)
-
MODI (optimality test)
⭐ 6. Assignment Problem
Objective: Minimum cost assignment.
Method: Hungarian Method
⭐ B. DATA ANALYTICS
⭐ 1. Introduction
Analytics = Converting data into insights.
⭐ 2. Types of Analytics
✔ Descriptive
“What happened?”
Uses: charts, mean, median, reports.
✔ Predictive
“What will happen?”
Uses: regression, forecasting.
⭐ 3. Data Sources
-
Internal
-
External
-
Primary
-
Secondary
⭐ 4. Data Quality Issues
-
Missing values
-
Duplicates
-
Inconsistency
-
Outliers
-
Wrong entry
⭐ 5. Handling Missing Data
-
Deletion
-
Mean/median substitution
-
Regression imputation
-
Interpolation
⭐ 6. Data Classification
-
Binary
-
Multi-class
-
Ordinal
-
Nominal
⭐ 7. Predictive Analytics Methods
-
Regression
-
Machine learning models
-
Time series
⭐ 8. Univariate & Multivariate
-
Univariate → one variable
-
Multivariate → multiple variables
⭐ FINAL SUPER SUMMARY
(10 minutes revision)
UNIT 1 – Stats
Central tendency → Mean/Median/Mode
Dispersion → Range, SD, Variance
Skewness → shape
Kurtosis → peak
Chebyshev → 75% in 2SD
UNIT 2 – Probability
Addition law
Multiplication law
Bayes theorem
Distributions:
-
Binomial → success/failure
-
Poisson → rare events
-
Normal → bell curve
UNIT 3 – Correlation & Regression
Correlation (r → –1 to +1)
Rank correlation
Regression: Y = a + bX
Properties: bxy × byx = r²
UNIT 4 – LPP + Analytics
LPP → maximize/minimize
Graphical method
Simplex (steps)
Transportation (VAM, MODI)
Assignment (Hungarian)
Analytics → descriptive, predictive
Missing data handling
Classification
Comments
Post a Comment