R

CK Cafe: Using Association Rules to Find Basket of Goods

In this lab session, I share how to use apriori algorithm for association mining. The goal is to find useful causal and association rules which can help in designing promotions for the company. Plus, you get to see what’s served at an Indian cafe.

Next — Today I learnt About R

What is Next? A short and sweet curated collection of R-related works. Five stories. Four packages. Three jargons. Two tweets. One Meme. You can subscribe by providing your details here. Promise, no spams. Email address First name (Optional) Last name (Optional) By subscribing, you agree with Revue’s Terms of Service and Privacy Policy. If you are unsure, here are some editions that my readers loved.

Street Maps (of Some Cities)

Over the last few days, I dabbled with maps in R. Two days ago, I made a map of all the cities I’ve visited. Today, I thought to make street maps of some of them (and other cool cities).

Billionaires and Taxes

How much do the super rich really pay? Until now, the answer was unknown — thanks to privacy laws. Recently, tax returns of the super rich was leaked to ProPublica. I thought of exploring the billionaires and their tax rates.

I Web, Therefore I Exist

Slides, recorded lecture and additional resources around my talk on how to create and control your digital identity.

IndiaPIN: R Data Package

R Package for All India PIN Codes Directory with Latitude and Longitude Details (Updated: December 2021)

Is COVID-19 Data tampered?

Is there any evidence of tampering or manipulation in COVID-19 daily cases reported by countries? Using Benford analysis in R, I try to reach at some conclusion.

Dynamic GP: Application to Malaria Vaccine Coverage Prediction

We applied a dynamic Gaussian process model to predict coverage for novel Malaria vaccines in 78 countries. Using publicly available WHO data on coverage of nine vaccines, we developed localised models for countries grouped using the human development index (HDI). We deployed convolutions of standard GP models with weights determined using singular value decomposition of time-series response matrix.

Statistical Modelling and Analysis of the Computer-Simulated Datasets

My first academic publication: a peer-reviewed book chapter on statistical modelling using Gaussian processes. We reviewed several GP models and correlation structures, and methods to handle numerical instabilities due to near-singular matrices. Finally, we reviewed several algorithms developed specifically for analysing big data obtained from computer simulators.