Monthly Archives: March 2018

Regularization: Lasso vs Ridge

Regularization: a process of adding a tuning paramater to a model to induce the smoothness of the weights (prevent the coefficients to fit so perfectly) in order to prevent overfitting. It’s most often done by adding a constant multiple to an existing weight vector, the constant can be any norm (sometimes it will be Lasso […]

Read More

Data scientist’s toolbox

Here’s a list of statistical and machine learning concepts that are in every data scientist’s toolbox. Tree based methods Some of the most universally useful methods in data science are decision tree based: decision trees, random forests and gradient boosted trees. Decision trees as base learners have a lot of very useful characteristics most of […]

Read More

Feature Selection Techniques

Variable selection is used to identify a subset of the available inputs that accurately predict and output. Why need variable selection?  With a smaller dataset, you can reduce the cost for data collection/cleaning, improve speed/performance by decreasing computation time and scoring efforts. And also improve interchangeability, remove multicollinearity & irralational coefficients, treat missing data, reduce […]

Read More

Model Evaluation

When we talk about predictive models, we are talking either about a regression model (continuous output) or a classification model (nominal or binary output). The evaluation metrics used in each of these models are different. In classification problems, we use two types of algorithms (dependent on the kind of output it creates): Class output : […]

Read More

Marcus by Goldman Sachs: Digital Finance

Goldman Sachs was recently honored by Fast Company as one of the most innovative companies in finance. Head of Digital Finance Harit Talwar discusses the culture of innovation behind Marcus, the firm’s first consumer digital lending platform, and what it’s been like building a start-up within a nearly 150-year-old company. On innovation, Harit notes: “The […]

Read More

Personal Loan: Too Many Big or Small Players

Personal Loans: Estimated offers for $10,000 Debt consolidation loans for borrowers with good credit You can use an unsecured personal loan to consolidate debt or finance large purchases. Interest rates and terms can vary, based on your credit score and other factors. Compare loans from multiple lenders and learn more about personal loans. Competition heats […]

Read More

The Profitability of the Average Checking Account

Maintaining a customer’s checking account costs your financial institution money. The American Bankers Association estimates the annual cost to a bank to maintain a checking account is between $250 and $400 per year. For community financial institutions with less than $5 billion in assets, the average according to other researchers is closer to $250 to […]

Read More

Mobile Analytics

WHY ARE MOBILE ANALYTICS IMPORTANT? Mobile usage surpassed that of desktop in 2015 and is fast becoming consumers’ preferred portal to the internet. Always-on consumers spend 70 percent of their media consumption and screen time on mobile devices, and most of that time in smartphone apps. This is a tremendous opportunity for companies to reach their […]

Read More

Storytelling with Data

https://www.analyticsvidhya.com/blog/2017/10/art-story-telling-data-science/ Introduction The idea of storytelling is fascinating; to take an idea or an incident, and turn it into a story. It brings the idea to life and makes it  more interesting. This happens in our day to day life. Whether we narrate a funny incident or our findings, stories have always been the “go-to” […]

Read More

109 Commonly Asked Data Science Interview Questions

https://www.springboard.com/blog/data-science-interview-questions/         Table of Contents Statistics Programming General Big Data Python R SQL Modeling Behavioral Culture Fit Problem-Solving 1. Statistics Statistical computing is the process through which data scientists take raw data and create predictions and models backed by the data. Without an advanced knowledge of statistics it is difficult to succeed […]

Read More