As an Executive MBA student at Lagos Business School, I recently took a course on Data Analytics, where we covered two significant topics: Introduction to Probability and Discrete and Continuous Probability Distributions. In many cases, we can have poor intuition regarding the effects of probability. Here are some of my key learnings from the course:
Introduction to Probability: Probability is a fundamental concept in data analytics that helps us quantify the likelihood of events. It is the occurrence of a random event. It is defined as the ratio of the number of favorable outcomes to the total number of possible outcomes. Probability is expressed as a number between 0 and 1, where 0 indicates that an event is impossible, and 1 indicates that an event is certain. The analysis of events governed by probability is called statistics.
In data analytics, probability helps us analyze data and make informed decisions based on the likelihood of different outcomes. For instance, we can use probability to predict the probability of an event occurring, such as the likelihood of a customer purchasing a product or the likelihood of a machine breaking down.
Discrete Probability Distributions: A discrete probability distribution is a probability distribution that lists all the possible outcomes of a random variable and their corresponding probabilities. In the course, we learned about some of the most common discrete probability distributions such as the binomial distribution, Poisson distribution, and geometric distribution. The binomial distribution is used to model situations where there are only two possible outcomes, such as success or failure. The Poisson distribution is used to model situations where events occur randomly and independently, such as the number of customers visiting a store in an hour. The hypergeometric distribution is used to model situations where we are sampling from a finite population without replacement, such as drawing cards from a deck.
Continuous Probability Distributions: A continuous probability distribution is a probability distribution that describes the probability of a continuous random variable taking on a range of values. In the course, we learned about some of the most common continuous probability distributions such as the normal distribution, exponential distribution, and uniform distribution. The normal distribution is used to model many natural phenomena, such as human height and weight. The exponential distribution is used to model the probability of the time between two events occurring, and the uniform distribution is used to model the probability of a continuous random variable taking on a value within a certain range.
As an Executive MBA (EMBA) student, it’s essential to understand basic probability rules to make informed business decisions. Here’s a summary of the most important probability rules:
- Addition Rule: The probability of the occurrence of at least one of two or more events is equal to the sum of their individual probabilities minus the probability of their intersection.
- Multiplication Rule: The probability of the occurrence of two or more independent events is equal to the product of their individual probabilities.
- Conditional Probability: The probability of an event occurring, given that another event has already occurred.
- Bayes’ Theorem: A formula used to calculate conditional probabilities.
- Expected Value: The average outcome of a random event, weighted by its probability of occurrence.
In conclusion, the “Data Analytics” course provided me with a solid understanding of Introduction to Probability, Probability rules, and Discrete and Continuous Probability Distributions. These concepts are essential in data analytics and decision-making. The knowledge gained from this course will be useful in my career as an executive and in making informed business decisions. Understanding probability rules and probability distributions will enable me to analyze data effectively, identify patterns, and make predictions. I believe that the application of these concepts in business analytics will improve efficiency, optimize resources, and lead to better decision-making in organizations.