Random Variables and Distribution Functions

📘 Random Variables – An Overview

🔹 1. Intuition and Example

A random variable (r.v.) is a real-valued function that assigns a number to each outcome of a random experiment.

Example:
Let the experiment be two coin tosses.
Sample Space $S = \{HH, HT, TH, TT\}$
Define a random variable $X$ = number of heads.

Outcome (ω)	HH	HT	TH	TT
X(ω)	2	1	1	0

🔹 2. Mathematical Definition

Let $(S, B, P)$ be a probability space:

$S$ : Sample space
$B$ : σ-field (collection of measurable subsets of $S$ )
$P$ : Probability function defined on $B$

Then:

A random variable $X (ω) is a real-valued function defined on$ $S$ , such that for every real number $a$ , the set

\{\omega \in S : X(\omega) \leq a\} \in B

This ensures that probabilities like $P(X \leq a)$ are well-defined.

🔹 3. Types of Random Variables

One-dimensional: Real-valued $X: S \to \mathbb{R}$
Two-dimensional: Vector-valued $X: S \to \mathbb{R}^2$
n-dimensional: $X: S \to \mathbb{R}^n$

🔹 4. Measurability

A random variable must be measurable, meaning it maps events in the sample space to real numbers in such a way that all standard probability statements (like $P(X \leq a)$ ) can be interpreted as events in the σ-field $B$ .

🔹 5. Notation and Probability Statements

$X$ : Random variable (uppercase letter)
$x$ : Value taken by $X$ (lowercase)
$\omega$ : Sample point in $S$

Some useful probability notations:

Statement	Meaning
$P (X = x)$	Probability that random variable $X$ equals $x$
$P(X \leq a)$	Probability that $X \leq a$
$P(a < X \leq b)$	Probability that $X$ lies in the interval $(a, b]$
$P(X = a \text{ or } X = b)$	Union of events: $P(X = a) + P(X = b)$ if disjoint
$P(X = a \text{ and } X = b)$	Intersection of events (0 if $a \neq b$ )

🔹 6. Example Interpretation

If in an experiment, $X$ is the number of heads in two tosses:

P(X \leq 1) = P(\{HH, HT, TH\}) = \frac{3}{4}

This is just the probability of all outcomes $\omega$ such that $X(\omega) \leq 1$

🎲 Illustration 1: Tossing a Coin

✳️ Experiment: Toss a coin

Sample Space:
$S = \{\omega_1, \omega_2\} = \{\text{H}, \text{T}\}$

✳️ Define a Random Variable $X(\omega)$ :

$X(\omega) = \begin{cases} 1, & \text{if } \omega = \text{H (Head)} \\ 0, & \text{if } \omega = \text{T (Tail)} \end{cases}$

This is a Bernoulli random variable, since it only takes two values: 0 and 1.
It is an example of a discrete and finite-valued random variable.

🎲 Illustration 2: Rolling a Die

✳️ Experiment: Roll a fair six-sided die

Sample Space:
$S = \{1, 2, 3, 4, 5, 6\}$

✳️ Define Random Variable $X(\omega)$ :

Let $X(\omega) = \omega$ , i.e., the number appearing on the die.

So,

$X(1) = 1$
$X(2) = 2$
...
$X(6) = 6$

✳️ Alternative Random Variable $Y (ω):$

Suppose we are only interested in whether the die roll is even or odd.

Then define:

$Y(\omega) = \begin{cases} 0, & \text{if } \omega \text{ is even } (2, 4, 6) \\ 1, & \text{if } \omega \text{ is odd } (1, 3, 5) \end{cases}$

So,

$Y(2) = Y(4) = Y(6) = 0$
$Y(1) = Y(3) = Y(5) = 1$

This again is a Bernoulli-type random variable since it takes only two values (0 and 1), representing a classification (even vs. odd).

📚 Theorems on Random Variables (without Proof)

🔸 Theorem 5.1 – Measurability

A function X(ω) from a sample space S to the real line R=(−∞,∞) is a random variable if and only if the set {ω∈S:X(ω)<a}

belongs to the σ-algebra B\mathcal{B} (i.e., is a measurable event) for all real numbers a.

🔸 Theorem 5.2 – Algebra of Random Variables

If X1 and X2 are random variables and C is a constant, then:

CX1( scalar multiple)
CX1+CX2( sum)

X1X2(product)

are also random variables.

✅ Remark:
It follows that any linear combination like $C_1X_1 + C_2X_2$ is also a random variable.
In particular, $X_1 - X_2$ is a random variable.

🔸 Theorem 5.3 – Supremum, Infimum, Limits

If {Xn(ω),n≥1} is a sequence of random variables, then:

sup⁡nXn(ω),
inf⁡nXn(ω),
lim sup⁡n→∞Xn(ω),
lim inf⁡n→∞Xn(ω)

are all random variables, provided they are finite for all ω\omega.

🔸 Theorem 5.4 – Positive and Negative Parts, Absolute Value

If $X$ is a random variable, then the following are also random variables:

1/X ( ω )

X(\omega) \neq 0

X+(ω)=max⁡(0,X(ω)) – positive part

X−(ω)=−min⁡(0,X(ω)) – negative part

∣X(ω)∣

🔸 Theorem 5.5 – Maximum and Minimum

If X1 and X2 are random variables, then:

max⁡(X1,X2)
min⁡(X1,X2)

are also random variables.

🔸 Theorem 5.6 – Function of a Random Variable

If X is a random variable and f(⋅) is a continuous function, then: f(X)is a random variable.

🔸 Theorem 5.7 – Monotonic Function

If X is a random variable and f(⋅) is an increasing function, then: f(X) is a random variable.

✅ Corollary:
If $f$ is a function of bounded variation on every finite interval $[a, b]$ , and $X$ is a random variable, then $f(X)$ is also a random variable.

📈 Distribution Function of a Random Variable

Let $X$ be a random variable defined on a probability space $(S, \mathcal{B}, P)$ . The distribution function (also called the cumulative distribution function or cdf) of $X$ is denoted by:

F_X(x) = P(X \leq x) = P\left( \omega \in S : X(\omega) \leq x \right), \quad -\infty < x < \infty

✅ Interpretation:

$F_X(x)$ gives the probability that the random variable $X$ takes a value less than or equal to $x$ .
It accumulates probabilities up to the value $x$ .

📌 Properties of a Distribution Function:

Non-decreasing:
If $a < b$ , then $F_X(a) \leq F_X(b)$
Right-continuous:
$\lim_{h \to 0^+} F_X(x + h) = F_X(x)$
Limits:
$\lim_{x \to -\infty} F_X(x) = 0,\quad \lim_{x \to \infty} F_X(x) = 1$
Bounded between 0 and 1:
$0 \leq F_{X} (x) \leq 1$

5.Probability Between Two Points

$a < b$

📘 Discrete Random Variable

A discrete random variable (r.v.) is a random variable that takes at most a countable number of distinct values.

👉 In simpler terms:

A function defined on a discrete sample space (like outcomes of tossing a coin or rolling a die).
Values can be finite (e.g., 0, 1, 2) or countably infinite (like 1, 2, 3, ...).

🎯 Probability Mass Function (PMF)

If $X$ is a discrete random variable that takes values $x_1, x_2, x_3, \dots$ then the probability mass function (p.m.f.) is:

p(x_i) = P(X = x_i)

✅ Conditions for a PMF:

Non-negativity:
$p(x_i) \geq 0 \quad \text{for all } i$
Total probability is 1:
$\sum_{i=1}^{\infty} p(x_i) = 1$

📌 The collection $\{ (x_i, p(x_i)) \}$ is called the probability distribution of $X$ .

🧠 Remark:

The set of values taken by $X$ is called the spectrum or support of the random variable.
For any event $E \subset \mathbb{R}$ , the probability:
$P(X \in E) = \sum_{x_i \in E} p(x_i)$

✅ Example: Tossing a Fair Coin

Sample space:

S = \{ H, T \}

Define random variable $X$ as:

$X(H) = 1$
$X(T) = 0$

This means:

Outcome	Random Variable $X$	Probability
H	1	$P(X = 1) = \frac{1}{2}$
T	0	$P (X = 0) = \frac{1}{2}$

🧾 This is a Bernoulli random variable.

Example Problem:

A random variable X has the following probability function:
Values of X.x :-2 -1 0 1 2 3
p(x) :0.1 k 0.2 2k 0.3 k
Find the value of k. and calculate mean and variance.
.

Given Values of $X$ :

x = -2,\ -1,\ 0,\ 1,\ 2,\ 3

Corresponding probabilities $p(x)$ :

p(x) = 0.1,\ k,\ 0.2,\ 2k,\ 0.3,\ k

🔹 (i) Find the value of $k$

Since the total probability must equal 1, we write:

0.1 + k + 0.2 + 2k + 0.3 + k = 1

Combine like terms:

(0.1 + 0.2 + 0.3) + (k + 2k + k) = 1 \Rightarrow 0.6 + 4k = 1 \Rightarrow 4k = 1 - 0.6 = 0.4 \Rightarrow k = \frac{0.4}{4} = 0.1

✅ So, $k = 0.1$

🔹 Now calculate Mean $E (X):$

E(X) = \sum x \cdot p(x)

x	p(x)	x·p(x)
-2	0.1	-0.2
-1	0.1	-0.1
0	0.2	0
1	0.2	0.2
2	0.3	0.6
3	0.1	0.3

E(X) = -0.2 - 0.1 + 0 + 0.2 + 0.6 + 0.3 = 0.8

✅ Mean $E(X) = 0.8$

🔹 Now calculate Variance $\text{Var}(X)$ :

We need:

\text{Var}(X) = E(X^2) - [E(X)]^2

First calculate $E(X^2) = \sum x^2 \cdot p(x)$

x	x²	p(x)	x²·p(x)
-2	4	0.1	0.4
-1	1	0.1	0.1
0	0	0.2	0
1	1	0.2	0.2
2	4	0.3	1.2
3	9	0.1	0.9

E(X^2) = 0.4 + 0.1 + 0 + 0.2 + 1.2 + 0.9 = 2.8

\text{Var}(X) = E(X^2) - [E(X)]^2 = 2.8 - (0.8)^2 = 2.8 - 0.64 = 2.16

✅ Variance $\text{Var}(X) = 2.16$

✅ Final Answers:

Value of $k$ : 0.1
Mean $E(X)$ : 0.8
Variance $\text{Var}(X)$ : 2.16

Example-2

Given the probability function

x = 0 1 2 3

p(x) = 0·1 0·3 0·5 0·1

Let Y = X^2 + 2X , then find

(i) the probability function of Y

(ii) mean and variance of Y

✅ Given:

Random variable $X$ with the following probability function:

$X$	0	1	2	3
$P(X)$	0.1	0.3	0.5	0.1

You are asked to define a new random variable:

Y = X^2 + 2X

✅ (i) Find the probability function of $Y$ :

We'll compute $Y$ for each value of $X$ :

Y = X^2 + 2X

$X$	$P(X)$	$Y = X^2 + 2X$
0	0.1	$0^2 + 2×0 = 0$
1	0.3	$1^2 + 2×1 = 3$
2	0.5	$4 + 4 = 8$
3	0.1	$9 + 6 = 15$

Now we define the probability function of Y:

$Y$	$P(Y)$
0	0.1
3	0.3
8	0.5
15	0.1

✅ (ii) Mean and Variance of $Y$ :

🔹 Mean: $E(Y) = \sum Y \cdot P(Y)$

E(Y) = (0)(0.1) + (3)(0.3) + (8)(0.5) + (15)(0.1) = 0 + 0.9 + 4.0 + 1.5 = 6.4

🔹 Variance:

\text{Var}(Y) = E(Y^2) - [E(Y)]^2

We compute $Y^2 \cdot P(Y)$ :

$Y$	$Y^2$	$Y^2 \cdot P(Y)$
0	0	$0 × 0.1 = 0.0$
3	9	$9 × 0.3 = 2.7$
8	64	$64 × 0.5 = 32.0$
15	225	$225 × 0.1 = 22.5$

E(Y^2) = 0 + 2.7 + 32.0 + 22.5 = 57.2

\text{Var}(Y) = 57.2 - (6.4)^2 = 57.2 - 40.96 = 16.24

✅ Final Answers:

(i) Probability function of $Y$ :

$Y$	$P (Y)$
0	0.1
3	0.3
8	0.5
15	0.1

(ii) Mean of $Y$ :
$E(Y) = 6.4$
Variance of $Y$ :
$\text{Var}(Y) = 16.24$

Continuous Random Variable

A random variable $X$ is said to be continuous if it can take any possible value within a certain interval. In other words:

A continuous random variable is one whose possible values cannot be listed individually, because they form an uncountably infinite set — such as all real numbers between two limits.

This means its values cannot be put into one-to-one correspondence with positive integers (unlike discrete variables).

Key Characteristics:

Can assume infinitely many values in a given range.
Is measurable to any desired level of precision.
The probability at any exact point is zero, i.e.,
$P(X = a) = 0$
Instead, probability is defined over an interval, using a Probability Density Function (PDF):
$P(a \leq X \leq b) = \int_{a}^{b} f(x)\, dx$
where $f(x) \geq 0$ and
$\int_{-\infty}^{\infty} f(x) \, dx = 1$

Examples of Continuous Random Variables:

Variable	Description
Age	20.51 years, 20.512 years, etc.
Height	172.3 cm, 172.345 cm, etc.
Weight	64.2 kg, 64.2356 kg, etc.
Temperature	36.5°C, 36.5021°C, etc.
Time	10.5 seconds, 10.51 seconds, etc.

Comparison with Discrete Random Variables

Feature	Discrete Variable	Continuous Variable
Value type	Countable (1, 2, 3…)	Uncountable (real values)
Probability distribution	Probability Mass Function (PMF)	Probability Density Function (PDF)
Example	Number of students, toss of a die	Height, weight, age
Probability at exact value	$> 0$	$= 0$

Probability Density Function (PDF) – Concept and Definition

A Probability Density Function (PDF) is used to describe the probability distribution of a continuous random variable.

Let $X$ be a continuous random variable, and let $f(x)$ be a continuous function defined over the domain of $X$ . Then:

P(x \leq X \leq x + dx) = f(x) \cdot dx

This means that the probability that the random variable $X$ takes a value within the small interval $(x, x + dx)$ is approximately:

\text{Probability} \approx f(x) \, dx

Here,

$f(x)$ is the probability density function,
$dx$ is a very small interval (infinitesimally small),
$f(x) \cdot dx$ gives the probability that $X$ lies in the interval $(x, x + dx)$

Key Properties of PDF

Non-Negativity:
$f(x) \geq 0 \quad \text{for all real } x$
Total Area Under the Curve Is 1:
$\int_{-\infty}^{\infty} f(x)\, dx = 1$
This expresses that the total probability is 1.
Probability over an interval:
$P(a \leq X \leq b) = \int_{a}^{b} f(x) \, dx$
Probability at an exact value is zero:
$P(X = x) = 0$
Because the width of the interval is zero, the probability is zero.

Example

If the PDF of a continuous random variable $X$ is:

f(x) = \begin{cases} 2x & \text{if } 0 \leq x \leq 1 \\ 0 & \text{otherwise} \end{cases}

Then,

The probability that $X$ lies between 0.2 and 0.5 is:
$P(0.2 \leq X \leq 0.5) = \int_{0.2}^{0.5} 2x \, dx = [x^2]_{0.2}^{0.5} = 0.25 - 0.04 = 0.21$

Here's the graph of the probability density function (PDF) $f(x) = 2x$ for $0 \leq x \leq 1$

The blue curve represents the function

f (x) = 2 x

The orange shaded area shows the probability that the random variable

X

lies in the interval

[0.2, 0.5]

, which we calculated earlier as 0.21.

📌 Important Remark: Probability at a Point for Continuous Random Variables

✅ For Discrete Random Variables:

A discrete random variable $X$ can take on specific values like $x = 1, 2, 3, etc.$
The probability at a particular point is not zero:
$P(X = c) \neq 0 \quad \text{for some } c$
For example, if $P (X = 2) = 0.4, that means there's a 40% chance that$ $X = 2$ .

✅ For Continuous Random Variables:

A continuous random variable can take infinitely many values over an interval (like time, height, weight).
Probability at a single point is always zero:
$P(X = c) = 0 \quad \text{for all } c$
This is because the total probability (area under the curve) is spread over an interval, and the area at a single point is zero.

🔁 This Leads to an Important Equality:

For continuous random variables, whether you include or exclude the endpoints of an interval does not matter:

P(a \leq X \leq b) = P(a < X \leq b) = P(a \leq X < b) = P(a < X < b)

This is valid because:

P(X = a) = 0 \quad \text{and} \quad P(X = b) = 0

So, interval types (open, closed, half-open) are interchangeable for probability purposes in continuous distributions.

📘 Measures for Continuous Probability Distributions

Let $f(x)$ be a probability density function (p.d.f.) defined on the interval $[a, b]$ .

1. Arithmetic Mean (Expected Value)

$\mu = \int_a^b x \cdot f(x)\, dx$

2. Harmonic Mean (H)

$\frac{1}{H} = \int_a^b \frac{1}{x} f(x)\, dx$

3. Geometric Mean (G)

$\log G = \int_a^b \log x \cdot f(x)\, dx \quad \Rightarrow \quad G = e^{\int \log x \cdot f(x)\, dx}$

4. Moments

(a) r-th moment about origin:

$\mu_r' = \int_a^b x^r \cdot f(x)\, dx$

(b) r-th moment about a point $A$ :

$\mu_r(A) = \int_a^b (x - A)^r \cdot f(x)\, dx$

(c) Central moments (about mean):

$\mu_r = \int_a^b (x - \mu)^r \cdot f(x)\, dx$

Specifically:

Variance:

$\sigma^2 = \mu_2 = \int_a^b (x - \mu)^2 \cdot f(x)\, dx = \mu_2' - \mu^2$

5. Skewness and Kurtosis

Skewness $\gamma_1 = \frac{\mu_3}{\sigma^3}$
Kurtosis $\gamma_2 = \frac{\mu_4}{\sigma^4}$

You can compute $\mu_3$ , $\mu_4$ using:

$\mu_3 = \mu_3' - 3\mu_1'\mu_2' + 2\mu_1'^3$ $\mu_4 = \mu_4' - 4\mu_3'\mu_1' + 6\mu_2'\mu_1'^2 - 3\mu_1'^4$

6. Median (M)

The value of $M$ satisfies:

$\int_a^M f(x)\, dx = \frac{1}{2}$

7. Mean Deviation (about mean $\mu$ )

$\text{M.D.} = \int_a^b |x - \mu| \cdot f(x)\, dx$

8. Quartiles and Deciles

First Quartile $Q_1$ :

$\int_a^{Q_1} f(x)\, dx = \frac{1}{4}$

Third Quartile $Q_3$ :

$\int_a^{Q_3} f(x)\, dx = \frac{3}{4}$

i-th Decile $D_i$ :

$\int_a^{D_i} f(x)\, dx = \frac{i}{10}$

9. Mode

The value of $x$ where the density $f(x)$ is maximum:

f'(x) = 0 \quad \text{and} \quad f''(x) < 0

Example Problem-1

A continuous random variable $X$ with p.d.f.
$f(x) = A + Bx \quad \text{for } 0 \leq x \leq 1$
The mean of the distribution is 1.

find constants $A$ and $B$ .

✅ Step 1: Use the property that total probability = 1

$\int_0^1 f(x)\, dx = 1 \Rightarrow \int_0^1 (A + Bx)\, dx = 1$

✅ Step 2: Use the mean formula

$\text{Mean} = \int_0^1 x f(x)\, dx = 1 \Rightarrow \int_0^1 x(A + Bx)\, dx = 1$

✅ Step 3: Solve the two equations

From Equation 1:

$A + \frac{B}{2} = 1 \Rightarrow A = 1 - \frac{B}{2} \tag{3}$

Substitute into Equation 2:

$\frac{1 - \frac{B}{2}}{2} + \frac{B}{3} = 1 \Rightarrow \frac{1}{2} - \frac{B}{4} + \frac{B}{3} = 1$

Substitute back into Equation (3):

$A = 1 - \frac{6}{2} = 1 - 3 = -2$

✅ Final Answer:

$\boxed{A = -2,\quad B = 6}$

Example Problem-2:

We are given a probability density function (PDF):

$f(x) = c x^2 (1 - x), \quad \text{for } 0 < x < 1$

We are to find c

(i) The constant $c$

We know:

$\int_0^1 f(x) \, dx = 1 \Rightarrow \int_0^1 c x^2 (1 - x) \, dx = 1$

Factor out $c$ :

$c \int_0^1 x^2(1 - x) \, dx = 1$

First, expand the integrand:

$x^2(1 - x) = x^2 - x^3$

Now integrate:

$c \int_0^1 (x^2 - x^3)\, dx = c \left[\frac{x^3}{3} - \frac{x^4}{4}\right]_0^1 = c\left(\frac{1}{3} - \frac{1}{4}\right) = c\left(\frac{4 - 3}{12}\right) = c \cdot \frac{1}{12}$

Set equal to 1:

$\frac{c}{12} = 1 \Rightarrow \boxed{c = 12}$

(ii) Mean $\mu = E[X]$

$E[X] = \int_0^1 x f(x)\, dx = \int_0^1 x \cdot 12x^2(1 - x) \, dx = 12 \int_0^1 x^3(1 - x) \, dx$

Expand the integrand:

$x^3(1 - x) = x^3 - x^4$

Integrate:

$12 \left[\frac{x^4}{4} - \frac{x^5}{5}\right]_0^1 = 12\left(\frac{1}{4} - \frac{1}{5}\right) = 12 \cdot \frac{1}{20} = \boxed{\frac{3}{5}}$

(iii) Variance $\sigma^2 = E[X^2] - (E[X])^2$

First compute $E [X^{2}]:$

$E[X^2] = \int_0^1 x^2 f(x)\, dx = \int_0^1 x^2 \cdot 12x^2(1 - x) \, dx = 12 \int_0^1 x^4(1 - x)\, dx$ $= 12 \int_0^1 (x^4 - x^5)\, dx = 12 \left[\frac{x^5}{5} - \frac{x^6}{6}\right]_0^1 = 12 \left(\frac{1}{5} - \frac{1}{6}\right) = 12 \cdot \frac{1}{30} = \frac{2}{5}$

Now, use the formula:

$\text{Var}(X) = E[X^2] - (E[X])^2 = \frac{2}{5} - \left(\frac{3}{5}\right)^2 = \frac{2}{5} - \frac{9}{25} = \frac{10 - 9}{25} = \boxed{\frac{1}{25}}$

✅ Final Answers:

(i) $c = 12$
(ii) Mean $\mu = \boxed{\frac{3}{5}}$
(iii) Variance $\sigma^2 = \boxed{\frac{1}{25}}$