Google News
logo
Statistics in Data Science - Quiz(MCQ)

Statistics in Data Science : A visual and mathematical portrayal of information is statistics. Data science is all about making calculations with data.

Statistics is divided broadly into two categories :

* Descriptive statistics : Provides ways to summarize data by turning unprocessed observations into understandable data that is simple to share.

* Inferential Statistics : With the help of inferential statistics, it is possible to analyze experiments with small samples of data and draw conclusions about the entire population (entire domain).

A)
2
B)
3
C)
4
D)
5

Correct Answer :   2


Explanation : Statistics is divided into two categories: descriptive statistics and inferential statistics.

* Descriptive statistics makes use of data to describe the population, either through numerical computations, graphs, or tables.

* Inferential Statistics draws conclusions and makes predictions about a population based on a sample of data drawn from that group.

A)
Statistics is used to process simple problems in the real world
B)
Statistics is used to process simple problems in the virtual world
C)
Statistics is used to process complex problems in the real world
D)
None of the above

Correct Answer :   Statistics is used to process complex problems in the real world


Explanation : Statistics is used to process complex problems in the real world so that Data Scientists and Analysts can look for meaningful trends and changes in Data.

A)
Data Science
B)
Data Scientist
C)
Machine Learning
D)
None of the above

Correct Answer :   Data Scientist


Explanation : * A data scientist is someone who is more skilled at statistics than any programmer and more skilled at programming than any statistician.

* Math and Statistics are vital for Data Science since they form the foundation of all Machine Learning Algorithms.

* Indeed, mathematics is everything around us, from forms, patterns, and colors to the number of petals in a flower.

* Mathematics is present in every part of our existence.

A)
predict the result
B)
to analyze raw data
C)
build a Statistical Model
D)
All of the above

Correct Answer :   All of the above


Explanation : Several Statistical functions, principles, and algorithms are implemented to analyze raw data, build a Statistical Model and infer or predict the result.

A)
Data Set
B)
Data Item
C)
Data Value
D)
Data variable

Correct Answer :   Data Item


Explanation : A Variable is any characteristics, number, or quantity that can be measured or counted. A variable may also be called a data item.

A)
2
B)
3
C)
4
D)
5

Correct Answer :   2


Explanation : An analysis of any event can be done in one of two ways: Quantitative Analysis and Qualitative Analysis.

A)
Math
B)
Statistics
C)
Both (A) and (B)
D)
None of the above

Correct Answer :   Both (A) and (B)


Explanation : Machine learning algorithms are built on the foundations of mathematics and statistics. To be a successful Data Scientist, we must first understand the fundamentals.

Math and statistics are the foundations of Machine Learning algorithms. It is critical to understand the techniques underlying various Machine Learning algorithms in order to determine when and how to use them.

A)
True
B)
False
C)
Can Not Say
D)
None of the above

Correct Answer :   True


Explanation : Machine Learning techniques and a data-driven approach are required to become a Data Scientist; however, Data Science is not limited to these domains. Math and statistics are important in Data Science because they may be utilized to develop Machine Learning models.

A)
Quantitative Analysis
B)
Qualitative Analysis
C)
Both (A) and (B)
D)
None of the above

Correct Answer :   Qualitative Analysis


Explanation : Qualitative or Non-Statistical Analysis gives generic information and uses text, sound and other forms of media to do so.

10 .
________ Statistics makes inferences and predictions about a population based on a sample of data taken from the population in question.
A)
Qualitative
B)
Descriptive
C)
Inferential
D)
Quantitative

Correct Answer :   Inferential


Explaination : Inferential Statistics makes inferences and predictions about a population based on a sample of data taken from the population in question.

A)
Analysis
B)
Data collection
C)
Interpretation and presentation
D)
All of the above

Correct Answer :   All of the above


Explanation : Statistics is a Mathematical Science that deals with the gathering, analysis, interpretation, and presentation of data.

Statistics is utilized in the real world to analyze complicated issues so that Data Scientists and Analysts may seek for relevant trends and changes in data.

Statistics, in a nutshell, may be used to gain useful insights from data by doing mathematical computations on it.

A)
Data science
B)
Inferential statistics
C)
Descriptive Statistics
D)
None of the above

Correct Answer :   Inferential statistics


Explanation : Inferential statistics generalizes a broad amount of data and employs probability to get a conclusion. It enables us to infer population parameters from sample statistics and develop models based on them.

We may use inferential statistics to construct predictions ("inferences") based on the data. We use inferential statistics to generate generalizations about a population based on data from samples.

13 .
________Statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or tables.
A)
Inferential
B)
Qualitative
C)
Descriptive
D)
Quantitative

Correct Answer :   Descriptive


Explaination : Descriptive Statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or tables.

A)
Mode
B)
Mean
C)
Median
D)
Standard Deviation

Correct Answer :   Mode


Explanation : The value most recurrent in the sample set is known as Mode.

A)
C
B)
C++
C)
R
D)
Ruby

Correct Answer :   R


Explanation : R language is commonly used with Statistics

A)
Inferential Statistics
B)
Descriptive Statistics
C)
Both (A) and (B)
D)
None of the above

Correct Answer :   Descriptive Statistics


Explanation : Descriptive statistics aids in data organization and focuses on data qualities by giving parameters.

Descriptive statistics are short descriptive coefficients that describe a particular data set, which might be a representation of the complete population or a sample of the population.

Measures of central tendency and measures of variability are two types of descriptive statistics (spread).

The mean, median, and mode are examples of measurements of central tendency, whereas standard deviation, variance, minimum and maximum variables, kurtosis, and skewness are examples of measures of variability.

A)
Mean
B)
Mode
C)
Both (A) and (B)
D)
Standard deviation

Correct Answer :   Standard deviation


Explanation : The standard deviation is just the square root of the variance, and it represents how much data deviates from its mean.

The standard deviation is frequently favored over the variance because it has the same unit as the data points, making it easier to read.

18 .
Variance describes how much a random variable differs from its expected value. It entails computing squares of deviations.
A)
True
B)
False
C)
Can not say
D)
None of the above

Correct Answer :   True


Explaination : The variance of a random variable defines how much it deviates from its predicted value. It requires calculating deviation squares.

The difference between each element and the mean is referred to as the deviation.

The population variance is the sum of the squared deviations. The sample variance is the average of the squared deviations from the mean.

A)
Data collection
B)
Data interpretation
C)
Identify patterns and trends
D)
All of the above

Correct Answer :   All of the above


Explanation : The discipline of gathering and analyzing data with numbers and graphs to detect patterns and trends is known as quantitative analysis or statistical analysis.

Quantitative analysis is a strategy for understanding behavior that employs mathematical and statistical modeling, measurement, and investigation.

A particular reality is represented numerically by quantitative analysts. Quantitative analysis is used to measure, evaluate, and value financial instruments, as well as anticipate real-world occurrences.

A)
Alternate Hypothesis
B)
Null Hypothesis
C)
Immediate Hypothesis
D)
All of the above

Correct Answer :   Alternate Hypothesis


Explanation : Alternate Hypothesis: Result disproves the assumption.

A)
0.01
B)
0.02
C)
0.03
D)
0.04

Correct Answer :   0.03


Explanation : If a bell curve is assumed, the probability of a "six sigma" event is on the order of one ten millionth of a percent.

A)
Mean
B)
Mode
C)
Median
D)
All of the above

Correct Answer :   All of the above


Explanation : The measurements of the centre are the mean, median, and mode. The average number obtained by summing all data points and dividing the total number of data points by the total number of data points.

The middle number is obtained by sorting all of the data points and selecting the one in the centre. The most common number—that is, the number that appears the most frequently.

A)
Data Mining
B)
Analyze raw data
C)
Data interpretation
D)
None of the above

Correct Answer :   Analyze raw data


Explanation : Several statistical functions, concepts, and algorithms are used to examine raw data, construct a statistical model, and infer or forecast the outcome.

Statistics has an impact on all aspects of life, including the stock market, life sciences, weather, retail, insurance, and education, to mention a few.

24 .
Central tendency measures like, mean, median, or measures of the spread, etc are used for statistical analysis.
A)
True
B)
False
C)
Can not say
D)
None of the above

Correct Answer :   True


Explaination : Data is represented graphically in the form of graphs such as histograms, line plots, and so on. The data is represented using some sort of central tendency. For statistical analysis, central tendency measurements such as mean, median, or measures of the spread, among others, are utilized.

A)
Testory
B)
Stautaory
C)
Chebyshev
D)
None of the above

Correct Answer :   Chebyshev


Explanation : Chebyshev's inequality is also spelled as Tchebysheff's inequality.

A)
Range
B)
Deviation
C)
Standard Deviation
D)
Inter Quartile Range

Correct Answer :   Inter Quartile Range


Explanation : Inter Quartile Range (IQR): It is the measure of variability, based on dividing a data set into quartiles.

A)
Convert a program from S to python
B)
Convert a program to decompress it
C)
Convert the program into a human readable document
D)
None of the above

Correct Answer :   Convert the program into a human readable document


Explanation : Literate Statistical Programming can be done with knitr.

A)
Combine explanatory text and data analysis code in a single document
B)
Require those data analysis summaries are always written in R
C)
Ensure that data analysis documents are always exported in JPEG format
D)
All of the above

Correct Answer :   Combine explanatory text and data analysis code in a single document


Explanation : Literate Statistical Practice is a programming methodology.

A)
Statistics In R is cross-platform compatible
B)
Statistics In R is a powerful scripting language
C)
Statistics In R is open-source and freely available
D)
All of the above

Correct Answer :   All of the above


Explanation : All is true regarding Statistics In R.

A)
eliette
B)
aleatory
C)
stochast
D)
None of the above

Correct Answer :   aleatory


Explanation : Random variable is also known as stochastic variable.

A)
HTML
B)
LaTeX
C)
Android
D)
RMarkdown

Correct Answer :   RMarkdown


Explanation : knitr is available on CRAN.

A)
There are three types of random variable
B)
Continuous random variable can take any value on the real line
C)
A random variable is a numerical outcome of an experiment
D)
All of the above

Correct Answer :   There are three types of random variable


Explanation : There are two types of random variable- continuous and discrete.

A)
No logical order
B)
Code is not automatic
C)
Slow processing of documents
D)
All of the above

Correct Answer :   Slow processing of documents


Explanation : Code and text is in one place.