by Bruno Rodrigues (guest)

Analysis of the Renert - Part 1: Scraping

In this series of blog posts, I show how you can scrape text from the internet and use it to perform a tidy text analysis. I analyze a Luxembourgish fable called Renert.

Read more → This is part 1 of a 3 part blog post. This post presents the Luxembourgish language as well as the literary work I am going to analyze using the R programming language. Part 2 deals with preparing the data for analysis, and finally part 3 is the analysis. Hope you enjoy! Luxembourg and the Luxembourgish language Luxembourg is a small European country, squeezed between France, Belgium and Germany. Over the course of its history, it’s been invaded over and over by either France or Prussia (later Germany).

by Kevin Rosamont

Churn Analysis - Part 2: Indicators

Find the best threshold value to optimize the different indicators and then, measuring the model performance with a lift curve.

Read more → pre code, pre, code { white-space: pre !important; overflow-x: scroll !important; overflow-y: scroll !important; word-break: keep-all !important; word-wrap: initial !important; max-height:25vh !important; } p img{ width:100%; !important; } .plotly.html-widget.html-widget-static-bound.js-plotly-plot{ width:100%; !important; } .svg-container{ width:100%; !important; } .main-svg{ width:100%; !important; } -- Hello everyone, In the last post we have decided to continue our study with the logistic regression. We have obtained the following ROC curve with an area under the curve (AUC) of 0.

Identifying customers that intent to leave their current telecom provider using decision tree, random forest and logistic regression models.

Read more → pre code, pre, code { white-space: pre !important; overflow-x: scroll !important; overflow-y: scroll !important; word-break: keep-all !important; word-wrap: initial !important; max-height:30vh !important; } p img{ width:100%; !important; } -- Hello everyone, Today we will make a churn analysis with a dataset provided by IBM. You can find the dataset here. What is a churn? We can shortly define customer churn (most commonly called “churn”) as customers that stop doing business with a company or a service.

by Bruno Rodrigues (guest)

Map unemployment using R with ggplot2

If you want to know how to create maps in R without too much efforts. You are reading the good blog post! The package ggplot2 offers strongs possibilities and creating maps is one of them.

Read more → In this blog post, I show various ways to create maps using R. You’ll need to install a lot of packages and download two data sets; the unemployment rate in Luxembourg as well as a shapefile. To get the unemployment rate in Luxembourg, you can take a look at our previous blog post or simply run the following lines: library(rvest) library(dplyr) library(purrr) library(janitor) library(tidyr) page_unemp = read_html("http://www.statistiques.public.lu/stat/TableViewer/tableViewHTML.aspx?ReportId=12950&IF_Language=eng&MainTheme=2&FldrName=3&RFPath=91") data_raw = page_unemp %>% html_nodes(".