Muhammad Imdad Ullah, Author at R Programming FAQs

Use of Important Functions in R

July 29, 2025 by Muhammad Imdad Ullah

Looking for the most important functions in R? This blog post answers key questions like creating frequency tables (table()), redirecting output (sink()), transposing data, calculating standard deviation, performing t-tests, ANOVA, and more. Perfect for R beginners and data analysts!

Important functions in R
R programming cheat sheet
Frequency table in R (table())
How to use sink() in R
Transpose data in R (t())
Standard deviation in R (sd())
T-test, ANOVA, and Shapiro-Wilk test in R
Correlation and covariance in R
Scatterplot matrices (pairs())
Diagnostic plots in R

This Important functions in R, Q&A-style guide covers essential R functions with clear examples, helping you master data manipulation, statistical tests, and visualization in R. Whether you’re a beginner or an intermediate user, this post will strengthen your R programming skills!

Which function is used to create a frequency table in R?

In R, a frequency table can be created by using table() function.

What is the use of `sink()` function?

The sink() function in R is used to redirect R output (such as the results of computations, printed messages, or console output) to a file instead of displaying it in the console. This is particularly useful for saving logs, results of analyses, or any other text output generated by R scripts.

Explain what transpose is and how it is performed.

Transpose is used for reshaping the data, which is used for analysis. Transpose is performed by t() function.

What is the length function in R?

The length() function in R gets or sets the length of a vector (list) or other objects. The length() function can be used for all R objects. For an environment, it returns the object number in it. NULL returns 0.

What is the difference between `seq(4)` and `seq_along(4)`?

seq(4) means vector from 1 to 4 (c(1,2,3,4)) whereas seq_along(4) means a vector of the length(4) or 1 (c(1)).

Vector $v$ is `c(1,2,3,4)` and list $x$ is `list(5:8)`. What is the output of `v*x[[1]]`?

[1] 5 12 21 32s

How do you get the standard deviation for a vector $x$?

sd(x, na.rm=TRUE)

$X$ is the vector `c(5,9.2,3,8.51,NA)`. What is the output of `mean(x)`?

The output will be NA.

How can one compute correlation and covariance in R?

Correlation is produced by cor() and covariance is produced by cov() function.

How to create scatterplot matrices?

pair() or splom() function are used to create scatterplot matrices.

What is the use of diagnostic plots?

It is used to check the normality, heteroscedasticity, and influential observations.

What is `principal()` function?

It is defined in the psych package that is used to rotate and extract the principal components.

Define `mshapiro.test()`?

It is a function which defined in the mvnormtest package. It produces the Shapiro-Wilk test for multivariate normality.

Define `barlett.test()`.

The barlett.test() is used to provide a parametric k-sample test of the equality of variances.

Define `anova()` function.

The anova() is used to compare the nested models. Read more One-Way ANOVA

Define `plotmeans()`.

It is defined under the gplots package, which includes confidence intervals, and it produces a mean plot for single factors.

Define `loglm()` function.

The loglm() function is used to create log-linear models.

What is `t-tests()` in R?

We use it to determine whether the means of two groups are equal or not by using t.test() function.

Statistics and Data Analysis

Matplotlib Python Quiz 14

July 23, 2025 by Muhammad Imdad Ullah

Test your Matplotlib skills with this 20-question Matplotlib Python Quiz! Perfect for students, data analysts, and data scientists preparing for job interviews. Test your knowledge about line plots, histograms, subplots, labels, legends, and more. Includes answers to help you master data visualization in Python! Let us start with the Online matplotlib Python Quiz now.

Online Matplotlib Python Quiz with Answers

What is Matplotlib primarily used for?
Which function is used to create a line plot in Matplotlib?
How do you display a Matplotlib plot in a Jupyter Notebook?
Which command is used to add a title to a plot?
What does plt.xlabel() do?
Which function is used to create a histogram?
What is the correct way to save a Matplotlib plot as a PNG file?
Which of the following is used to add a legend to a plot?
What does plt.subplots() return?
Which parameter controls the transparency of a plot element?
How do you create a bar plot in Matplotlib?
What is the purpose of plt.grid()?
Which Matplotlib backend is used for interactive plots in Jupyter Notebook?
How do you set the figure size in Matplotlib?
Which of the following steps are necessary to customize the kind parameter to render a bar or horizontal bar graph in Matplotlib?
When using Matplotlib, how can you access preconfigured color templates?
Which command is used to add a title to a plot?
How do you label the x-axis in Matplotlib?
What does plt.subplot(2, 2, 3) mean?
How do you set the x-axis limits in a plot?

Basic Statistics MCQs Test

Files in R Language

July 18, 2025 by Muhammad Imdad Ullah

Learn everything about files in R, including .RData, CSV, Excel, and text files. Discover how to read, write, and restore R objects using load(), save(), read.csv(), and more. Explore best practices for file handling in R and compare different file formats for efficient data management. Perfect for R programmers, data analysts, and researchers working with datasets in R.

What is a File in the R Language?

In R, a file refers to data stored on a computer storage device. The script written in R has an extension *.R that can read into R or write from R. R Files are essential for importing external data, saving results, and sharing work. The R script files contain code that can be executed within the R software environment.

Describe commonly used Files in R

For illustration purposes, I have categorized the commonly used files in R as code files, data files, and specialized data files.

Code Files:

.R (R script files)
.Rmd (R Markdown files)

Data Files:

.csv (Comma Separated Values) – Most common for tabular data
.txt (Plain text files)
.xlsx or .xls (Excel files)
.RData or .rda (R’s native binary format)

Specialized Data Formats:

.json (for structured data)
.xml (for hierarchical data)
.sav (SPSS files)
.dta (Stata files)

What are the best Practices for using Files in R?

Use relative paths when possible for portability
Check file existence before reading
Close connections (when the database connection is open) after reading/writing certain file types
Consider using the package here for more reliable file paths

What is .RData Files in R

An .RData (or .rda) file is a binary file format used by R. It is used to save multiple objects (variables, data frames, functions, etc.) in a compressed, space-efficient way. It is R’s native format for storing workspace data.

What are the Key Features of `.RData` Files?

The key features of .RData files in R are:

Stores Multiple Objects
- The ..RData can save several R objects (e.g., data frames, lists, models) in a single file.
- Example: save(df, model, list1, file = "mydata.RData")
Binary Format (Not Human-Readable)
- Unlike .csv or .txt, .RData files are not plain text and cannot be opened in a text editor.
Compressed by Default
- Uses compression to reduce file size (especially useful for large datasets).
Platform-Independent
- Can be shared across different operating systems (Windows, macOS, Linux).
Preserves Attributes
- Keeps metadata (e.g., variable labels, factors, custom classes).

Which command is used for restoring an R object from a file?

In R, one can restore the saved objects from a file using the load() function. The load() command loads all objects stored in the file into the current R environment. This command works with .RData or .rda files (these are binary files used by R). This command does not work with .csv, .txt, or xlsx, etc. files.

Explain the use of `load()` command with example.

The following example first creates objects $x$, $y$, and $z$. These objects will be saved in “my_work.RData” file. These objects will appear in the R workspace after loading.

x <- rnorm(10)
y <- 1:20
z <- "Level of Significance"

save(x, y, z, file = "my_work.RData")
load("my_work.RData")

How many ways are there to read and write files in R?

There are dozens of ways to read and write files in R. The best approach depends on the file type and size. Depending on the file format and the packages used, the following is a categorized breakdown of the most common methods:

Base R Functions

Reading Files
- read.table(): Generic function to read tabular data (e.g., .txt).
- read.csv(): For comma-separated values (CSV) files.
- read.delim(): For tab-delimited files (.tsv or .txt).
- scan(): Low-level function to read raw data.
- load(): Restores R objects from .RData or .rda files.
- readRDS(): Reads a single R object from .rds files.
Writing Files
- write.table(): Writes data frames to text files.
- write.csv(): Writes to CSV files.
- write.delim(): Writes tab-delimited files.
- save(): Saves multiple R objects to .RData or .rda.
- saveRDS(): Saves a single R object to .rds.

Using Packages

Reading Files

Package	Function	File Type Supported
`readr`	`read_csv()`	Faster CSV reading
`readxl`	`read_excel()`	Excel (`.xlsx`, `.xls`)
`data.table`	`fread()`	Fast CSV/TSV import
`haven`	`read_spss()`	SPSS (`.sav`)
`haven`	`read_stata()`	Stata (`.dta`)
`jsonlite`	`fromJSON()`	JSON files
`xml2`	`read_xml()`	XML files

Writing Files

Package	Function	File Type Supported
`readr`	`write_csv()`	Faster CSV export
`writexl`	`write_xlsx()`	Excel (`.xlsx`)
`data.table`	`fwrite()`	Fast CSV/TSV export
`haven`	`write_sav()`	SPSS (`.sav`)
`haven`	`write_dta()`	Stata (`.dta`)
`jsonlite`	`toJSON()`	JSON files
`xml2`	`write_xml()`	XML files

Specialized Methods

For Large Datasets

vroom (from the vroom package) – High-speed reading of large CSV/TSV files.
arrow (Apache Arrow) – Efficient for big data (supports Parquet, Feather formats).

For Databases

DBI + RSQLite/RMySQL/odbc: Read/write from SQL databases.

For Binary & Custom Formats

feather: Fast binary storage (works well with Python).
qs: A faster alternative to saveRDS() for large objects.

Statistics and Data Analysis

Use of Important Functions in R

Table of Contents