Frequency Distribution
Definition
Frequency Distribution — Meaning, Definition & Full Explanation
A frequency distribution is a tabular or graphical representation that organizes data into classes or intervals and shows how many observations fall within each class. It displays the pattern of how often values occur across a dataset, making it easier to identify trends, outliers, and the overall shape of the data. Frequency distributions are essential tools in statistical analysis and data interpretation across banking, finance, and risk management.
What is Frequency Distribution?
A frequency distribution organizes raw data into meaningful groups called classes or bins, then counts how many data points belong to each class. The resulting table or chart reveals the pattern and concentration of observations. For example, if you measure the heights of 100 customers visiting a bank branch, a frequency distribution might group heights into 5 cm intervals (150–155 cm, 155–160 cm, etc.) and count how many people fall into each interval.
The key principle underlying any frequency distribution is that classes must be both exhaustive and mutually exclusive. Exhaustive means every data point must fit into some class; mutually exclusive means no data point can belong to more than one class. A frequency distribution often approximates a normal distribution (bell curve), where most observations cluster around the middle value and fewer observations appear at the extremes. Frequency distributions help analysts spot patterns, calculate probabilities, and make data-driven decisions. They reduce large datasets into compact, understandable summaries and form the foundation for calculating other statistics like mean, median, mode, and standard deviation.
Free • Daily Updates
Get 1 Banking Term Every Day on Telegram
Daily vocab cards, RBI policy updates & JAIIB/CAIIB exam tips — trusted by bankers and exam aspirants across India.
How Frequency Distribution Works
The process of creating a frequency distribution follows these steps:
Define the range: Calculate the difference between the highest and lowest values in your dataset. If loan amounts range from ₹50,000 to ₹5,000,000, the range is ₹4,950,000.
Determine class intervals: Decide how many classes you need and the width of each interval. Common practice suggests 5 to 20 classes depending on dataset size. Narrower intervals provide more detail; wider intervals show broader patterns.
Set class boundaries: Establish the lower and upper limits for each class. Ensure no overlap (mutually exclusive). For example: ₹50,000–₹500,000, ₹500,001–₹1,000,000, and so on.
Tally observations: Count how many data points fall into each class. This count is the frequency.
Display the distribution: Present results as a frequency table (showing class and frequency count) or as a histogram, bar chart, or polygon graph.
Variants include:
- Simple frequency distribution: Shows raw count per class
- Relative frequency distribution: Shows percentage or proportion per class
- Cumulative frequency distribution: Running total of frequencies up to each class
- Grouped frequency distribution: Data organized into standard intervals (common in banking for loan amounts, deposit sizes, or transaction values)
Frequency Distribution in Indian Banking
In Indian banking operations, frequency distributions support risk assessment, customer analysis, and regulatory reporting. The Reserve Bank of India (RBI) requires banks to maintain frequency distributions for loan exposure monitoring—particularly for evaluating concentration risk across borrower segments, loan sizes, and industry sectors. Banks use frequency distributions to segment customers by deposit amounts, track non-performing asset (NPA) distributions across loan categories, and analyze transaction patterns for anti-money laundering (AML) compliance.
For instance, scheduled commercial banks regularly prepare frequency distributions of advances by sector, borrower size (micro, small, medium enterprises), and geographic region, as mandated by RBI prudential norms. These distributions help identify lending concentration and ensure compliance with priority sector lending requirements.
In JAIIB examination syllabus, frequency distributions appear under quantitative methods and statistical analysis modules. Candidates studying for JAIIB need to understand how to construct frequency tables, interpret histograms, and apply frequency analysis to banking scenarios such as analyzing customer deposit patterns or loan default rates.
Additionally, frequency distributions support credit risk modeling—banks analyze the frequency distribution of credit scores, loan-to-value ratios, and customer income levels to assess portfolio health and set appropriate risk premiums. Insurance companies use similar distributions when analyzing claim frequencies for actuarial calculations.
Practical Example
Consider Ravi Mehta, a retail credit analyst at HDFC Bank's Bangalore branch. The branch has approved 200 personal loans over the past quarter, ranging from ₹1,00,000 to ₹25,00,000. To understand the lending pattern and identify whether the portfolio is skewed toward small or large loans, Ravi constructs a frequency distribution.
He sets up 5 classes: ₹1,00,000–₹5,00,000 (60 loans), ₹5,00,001–₹10,00,000 (85 loans), ₹10,00,001–₹15,00,000 (35 loans), ₹15,00,001–₹20,00,000 (15 loans), and ₹20,00,001–₹25,00,000 (5 loans). The resulting frequency distribution shows that 72.5% of loans cluster in the first two classes, indicating the branch's strength in mid-ticket personal lending. Ravi then calculates a cumulative frequency to show that 145 loans (72.5%) are ₹10,00,000 or less. This insight helps the branch set realistic approval targets and adjust marketing spend toward high-frequency segments.
Frequency Distribution vs. Cumulative Frequency Distribution
| Aspect | Frequency Distribution | Cumulative Frequency Distribution |
|---|---|---|
| Definition | Shows count of observations in each class | Shows running total of frequencies up to and including each class |
| Calculation | Direct count per interval | Sum of all frequencies up to current interval |
| Graph type | Histogram, bar chart | Ogive (cumulative curve) or step graph |
| Use case | Identifying most common values; seeing distribution shape | Determining percentiles; finding median and quartiles |
A frequency distribution answers "How many observations are in this class?" while a cumulative frequency distribution answers "How many observations are at or below this point?" Both are valuable in banking—use frequency distribution to spot popular loan sizes, and use cumulative frequency to find the 75th percentile of customer savings balances.
Key Takeaways
- A frequency distribution organizes data into mutually exclusive, exhaustive classes and counts observations per class.
- Classes must not overlap (mutually exclusive) and must include all data points (exhaustive) to be valid.
- Frequency distributions form the basis for calculating mean, median, standard deviation, and detecting outliers in banking datasets.
- In Indian banking, the RBI mandates frequency distributions of advances by sector, borrower type, and loan size for regulatory reporting and risk management.
- Cumulative frequency distributions are used to calculate percentiles and quartiles, essential for credit risk assessment and portfolio benchmarking.
- Histograms and ogives are graphical representations of frequency and cumulative frequency distributions respectively.
- Frequency analysis supports JAIIB quantitative methods and is tested in statistical analysis modules.
- Relative frequency distributions convert raw counts to percentages, making cross-bank and cross-time-period comparisons easier.
Frequently Asked Questions
Q: What is the difference between frequency and relative frequency? A: Frequency is the absolute count of observations in a class (e.g., 85 loans). Relative frequency expresses this as a proportion or percentage (e.g., 42.5% of 200 loans). Relative frequency is useful for comparing datasets of different sizes.
Q: Can a frequency distribution have unequal class widths? A: Yes, but unequal class widths are less common and require careful interpretation. When widths differ, use frequency density (frequency ÷ class width) instead of raw frequency for accurate graphical representation, otherwise taller bars may mislead you about true concentration.
Q: How does frequency distribution help in assessing credit risk? A: Banks use frequency distributions to analyze the spread of credit scores, loan sizes, and borrower income levels across a portfolio. A skewed distribution (e.g., most borrowers in low-income segments) signals concentration risk, while a balanced distribution suggests healthy diversification and lower portfolio risk.