05-Statistics

Inferential statistics

Percentiles and Quartiles A percentile is a value below which a certain percentage of observation or data points lies. Percentiles show how a given value compares to others. The general rule is that if a value is in the kth percentile, it is greater than K per cent of the total values. Find percentile of n P = (n/N) * 100 n = Ordinal rank of values = No of values below n N = Total values in the dataset P% of the entire distribution falls below the value n [Read More]

04-Statistics

Inferential statistics

While descriptive statistics summarize the characteristics of a data set, inferential statistics help you come to conclusions and make predictions based on your data.

Inferential statistics have two main uses:

  • making estimates about populations (for example, the mean SAT score of all 11th graders in the US).
  • testing hypotheses to draw conclusions about populations (for example, the relationship between SAT scores and family income).
[Read More]

03-Statistics

Probability distributions

3.1 Probability Distribution Function

A probability distribution is a mathematical function that describes the probability of different possible outcomes for an experiment. Probability distributions are often depicted using graphs or probability tables.

Probability Distribution Function can be categorized into

  • Probability Density Function(PDF)
  • Probablity Mass Function(PMF)
  • Cumulative Distribution Function(CDF)
[Read More]

01-Statistics

Descriptive stats, Inferential stats

1.1 Statistics

  • Statistics is the science of collecting, organizing and analyzing the data.
  • Used for decision making process
  • Data - facts or pieces of information

1.2 Types of Statistics

  • Descriptive stats
    • It consists of organizing and summarizing the data.
  • Inferential stats
    • It consists of using data you have measured to form conclusion, make predictions.
    • By using sample-data make conclusion on population-data
[Read More]

Data Science RoadMap

Data Science

TL;DR

  • Data Acquisition
    • Excel, Statistics,Probability, SQL
  • Data Preparation
    • Python, Pandas, Numpy, Matplotlib/Seaborn
  • Exploratory data Analysis
    • Linear Algebra, Pandas
  • Data Modeling
    • Calculus, Machine learning, TensorFlow
  • Visualization
    • PowerBI, Tabelu
  • Deployment
    • Heroku, AWS
[Read More]

Flask

Flask

Flask

Flask is a popular Python web framework, meaning it is a third-party Python library used for developing web applications.

Flask is a lightweight WSGI web application framework. It is designed to make getting started quick and easy with the ability to scale up to complex applications.

Flask is based on the Werkzeug WSGI toolkit and Jinja2 template engine

[Read More]