Sampling in R Language: Important MCQs 10

The post is about Sampling in R Language. There are 20 multiple-choice questions from the sampling, ggplot2 package, and R language basics too. Let us start with the Quiz sampling in R Language.

R Language Online Quiz with Answers

1. Who introduced tidyverse to “share an underlying design philosophy, grammar, and data structures, of tidy data?

 
 
 
 

2. Which is the R command for selecting a sample of size 6 from the population yp <- c(111, 159, 121, 198, 120, 136, 14, 129, 17, 115, 186, 119, 121, 153, 143)

 
 
 
 

3. We obtain the 10000 random sample of size 6 under SRSWOR using the following population (111, 158, 122, 193, 111, 148, 112, 128, 113, 151, 185, 200, 199, 121, 115, 114) which is the R command for repeating this procedure 150 times?

 
 
 
 

4. A data analyst creates a scatterplot with a lot of data points. It is difficult for the analyst to distinguish the individual points on the plot because they overlap. What function could the analyst use to make the points easier to find?

 
 
 
 

5. Suppose, I have a vector x <- c(3, 6, 1, 19, 12, 8)and I want to set all elements of this vector that are less than 6 to be equal to zero. What R code achieves this?

 
 
 
 

6. A data analyst is working with a data frame named cars. The analyst notices that all the column names in the data frame are capitalized. What R code lets the analyst change all the column names to lowercase?

 
 
 
 

7. A data analyst is working with a data frame called “salary_data”. They want to create a new column named “total_wages” that adds together data in the “standard_wages” and “overtime_wages” columns. What R code lets the analyst create the “total_wages” column?

 
 
 
 

8. What is the class of the object defined by the expression x <- c(4, "a", TRUE)?

 
 
 
 

9. For the following population y<- c(1,2,3,4,5), what will be the R command for finding variance?

 
 
 
 

10. For the following population x<-c(1,2,3,4,5)what will be the R command for finding mean?

 
 
 
 

11. In ggplot2, what function do you use to map variables in your data to visual features of your plot?

 
 
 
 

12. What is the R command for generating bi-variate normal distribution

 
 
 
 

13. In ggplot2, which of the following concepts refers to the shape, color, and size of data points in a plot?

 
 
 
 

14. A data analyst creates a plot for visualization. The analyst wants to add a caption to the plot to help communicate important information. What function could the analyst use?

 
 
 
 

15. A data analyst is considering using tibbles instead of basic data frames. What are some of the limitations of tibbles?

 
 
 
 

16. A data analyst is working with a data frame named “stores”. It has separate columns for city (“city”) and state (“state”). The analyst wants to combine the two columns into a single column named “location”, with the “city” and “state” separated by a comma. What R code lets the analyst create the location column?

 
 
 
 

17. Which of the following functions lets you display smaller groups, or subsets, of your data?

 
 
 
 

18. What is the R command for selecting sample of size $n=10$ by probability proportional to size (PPS) with $N=40$.

 
 
 
 

19. Which of these dplyr verbs can be used to retrieve columns of a dataset?

 
 
 
 

20. Which is the R command for selecting a sample of size 3 from the population y<- c(11, 150, 121, 192, 233, 129, 117, 186, 129, 189, 159)

 
 
 
 

The dplyr package is used for data manipulation and transformation. It gives a set of functions that make it easy to perform common data manipulation tasks, which include (1) filtering, (2) grouping, (3) summarizing, (4) arranging, and (5) joining data frames.

The package is part of the tidyverse, a collection of R packages designed to work together seamlessly for data analysis and visualization.

Some key functions available in dplyr R Package include:

  • filter(): Used to subset rows based on specified conditions.
  • select(): Used to choose specific columns from a data frame.
  • arrange(): Used to reorder rows based on one or more columns.
  • mutate(): Used to create new columns or modify existing ones.
  • group_by(): Used to group data by one or more variables.
  • summarize(): Used to compute summary statistics for groups of data.
  • join(): Used to merge data frames based on common keys.

The dplyr package provides a powerful and efficient toolkit for data manipulation in R.

R FAQS Logo: Quiz Sampling in R Language

Quiz Sampling in R Language

  • What is the class of the object defined by the expression x <- c(4, “a”, TRUE)?
  • Suppose, I have a vector x <- c(3, 6, 1, 19, 12, 8)and I want to set all elements of this vector that are less than 6 to be equal to zero. What R code achieves this?
  • We obtain the 10000 random sample of size 6 under SRSWOR using the following population (111, 158, 122, 193, 111, 148, 112, 128, 113, 151, 185, 200, 199, 121, 115, 114) which is the R command for repeating this procedure 150 times?
  • Which is the R command for selecting a sample of size 6 from the population yp <- c(111, 159, 121, 198, 120, 136, 14, 129, 17, 115, 186, 119, 121, 153, 143)
  • For the following population y<- c(1,2,3,4,5), what will be the R command for finding variance?
  • For the following population x<-c(1,2,3,4,5)what will be the R command for finding the mean?
  • Which is the R command for selecting a sample of size 3 from the population y<- c(11, 150, 121, 192, 233, 129, 117, 186, 129, 189, 159)
  • What is the R command for generating bi-variate normal distribution
  • What is the R command for selecting a sample of size $n=10$ by probability proportional to size (PPS) with $N=40$?
  • Which of these dplyr verbs can be used to retrieve columns of a dataset?
  • Who introduced tidyverse to “share an underlying design philosophy, grammar, and data structures, of tidy data?
  • A data analyst is working with a data frame called “salary_data”. They want to create a new column named “total_wages” that adds together data in the “standard_wages” and “overtime_wages” columns. What R code lets the analyst create the “total_wages” column?
  • A data analyst is working with a data frame named “stores”. It has separate columns for city (“city”) and state (“state”). The analyst wants to combine the two columns into a single column named “location”, with the “city” and “state” separated by a comma. What R code lets the analyst create the location column?
  • A data analyst is considering using tibbles instead of basic data frames. What are some of the limitations of tibbles?
  • A data analyst is working with a data frame named cars. The analyst notices that all the column names in the data frame are capitalized. What R code lets the analyst change all the column names to lowercase?
  • In ggplot2, what function do you use to map variables in your data to visual features of your plot?
  • In ggplot2, which of the following concepts refers to the shape, color, and size of data points in a plot?
  • Which of the following functions lets you display smaller groups, or subsets, of your data?
  • A data analyst creates a scatterplot with a lot of data points. It is difficult for the analyst to distinguish the individual points on the plot because they overlap. What function could the analyst use to make the points easier to find?
  • A data analyst creates a plot for visualization. The analyst wants to add a caption to the plot to help communicate important information. What function could the analyst use?

Computer MCQs Test Online

SPSS Data Analysis

Questions about R: Important Frequently Asked

This post is about some frequently asked Questions about R Language. The frequently asked questions are about compilers in R, R packages, just in just-in-time compilers, procedural programming in R, and the Recycling rule of vectors. These questions will help you prepare for examinations and interviews.

Frequently Asked Questions About R

Questions about R Language

Question: What is a Compiler in R Language?
Answer: A compiler is software that transforms computer code (source code) to another computer language (target language, i.e., object code).

Question: What is a package in R Language?
Answer: The R package is a collection of R functions, compiled code, sample data, and help documentation. The R packages are stored in a directory called “library” in the R environment. The R language also installed a set of packages during installation.

Question: What is JIT?
Answer:
JIT standards for “Just in Time” compiler. It is a method to improve the run-time performance of a computer program.

Question: What is procedural Programming in R Language?
Answer:
Procedural programming is derived from structured programming and it is based on the concept of procedure call. Procedures are also known as routines, subroutines, or functions. It contains a series of computational steps to be carried out. Any procedure may be called (at any point) during a program’s execution.

Mathematical Operation in R

Question: What is the recycling of elements in a vector?
Answer: When a mathematical operation (such as addition, subtraction, multiplication, division, etc) is performed on two vectors of different lengths (the number of elements in both vectors is different), the element having a shorter length is reused to complete the mathematical operations.

vect1 <- c(4, 1, 4, 5, 6, 9)
vect2 <- c(2, 5)
vect1 * vect2 

###
8, 5, 8, 25, 12, 45

The elements of vect2 are recycled to complete the operation of all elements of vect1.

Question: What is the difference between a data frame and a matrix in R Language?
Answer: In R, the data frame contains heterogeneous data (different columns of the data frame may have different types of variable) while a matrix contains homogeneous data (all the columns of the matrix have the same type of variable). In a matrix, similar data types can be stored while in a data frame, different types of data can be stored.

See Questions about R language Missing Values

MCQs General Knowledge, MCQs in Statistics

R Language: A Quick Reference Guide – IV

R Quick Reference Guide

Quick Reference Quide R Language

R language: A Quick Reference Guide about learning R Programming with a short description of the widely used commands. It will help the learner and intermediate user of the R Programming Language to get help with different functions quickly. This Quick Reference is classified into different groups. Let us start with R Language: A Quick Reference – IV.

This Quick Reference will help in performing different descriptive statistics on vectors, matrices, lists, data frames, arrays, and factors.

Basic Descriptive Statistics in R Language

The following is the list of widely used functions that are further helpful in computing descriptive statistics. The functions below are not direct descriptive statistics functions, however, these functions are helpful to compute other descriptive statistics.

R CommandShort Description
sum(x1, x2, … , xn)Computes the sum/total of $n$ numeric values given as argument
prod(x1, x2, … , xn)Computes the product of all $n$ numeric values given as argument
min(x1, x2, … , xn)Gives smallest of all $n$ values given as argument
max(x1, x2, …, xn)Gives largest of all $n$ values given as argument
range(x1, x2, … , xn)Gives both the smallest and largest of all $n$ values given as argument
pmin(x1, x2, …)Returns minima of the input values
pmax(x1, x2, …)Returns maxima of the input values

Statistical Descriptive Statistics in R Language

The following functions are used to compute measures of central tendency, measures of dispersion, and measures of positions.

R CommandShort Description
mean(x)Computes the arithmetic mean of all elements in $x$
sd(x)Computes the standard deviation of all elements in $x$
var(x)Computes the variance of all elements in $x$
median(x)Computes the median of all elements in $x$
quantile(x)Computes the median, quartiles, and extremes in $x$
quantile(x, p)Computes the quantiles specified by $p$

Cumulative Summaries in R Language

The following functions are also helpful in computing the other descriptive calculations.

R CommandShort Description
cumsum(x)Computes the cumulative sum of $x$
cumprod(x)Computes the cumulative product of $x$
cummin(x)Computes the cumulative minimum of $x$
cummax(x)Computes the cumulative maximum of $x$

Sorting and Ordering Elements in R Language

The sorting and ordering functions are useful in especially non-parametric methods.

R CommandShort Description
sort(x)Sort the all elements of $x$ in ascending order
sort(x, decreasing = TRUE)Sor the all elements of $x$ in descending order
rev(x)Reverse the elements in $x$
order(x)Get the ordering permutation of $x$

Sequence and Repetition of Elements in R Language

These functions are used to generate a sequence of numbers or repeat the set of numbers $n$ times.

R CommandShort Description
a:bGenerates a sequence of numbers from $a$ to $b$ in steps of size 1
seq(n)Generates a sequence of numbers from 1 to $n$
seq(a, b)Generates a sequence of numbers from $a$ to $b$ in steps of size 1, it is the same as a:b
seq(a, b, by=s)Generates a sequence of numbers from $a$ to $b$ in steps of size $s$.
seq(a, b, length=n)Generates a sequence of numbers having length $n$ from $a$ to $b$
rep(x, n)Repeats the elements $n$ times
rep(x, each=n)Repeats the elements of $x$, each element is repeated $n$ times
R Quick Reference Guide Frequently Asked Questions About R

R Language: A Quick Reference – I

https://gmstat.com