Welcome!

David Smith

Subscribe to David Smith: eMailAlertsEmail Alerts
Get David Smith via: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Top Stories by David Smith

by Norman Matloff The American Statistical Association (ASA) leadership, and many in Statistics academia. have been undergoing a period of angst the last few years, They worry that the field of Statistics is headed for a future of reduced national influence and importance, with the feeling that: The field is to a large extent being usurped by other disciplines, notably Computer Science (CS). Efforts to make the field attractive to students have largely been unsuccessful. I had been aware of these issues for quite a while, and thus was pleasantly surprised last year to see then-ASA president Marie Davidson write a plaintive editorial titled, “Aren’t We Data Science?” Good, the ASA is taking action, I thought. But even then I was startled to learn during JSM 2014 (a conference tellingly titled “Statistics: Global Impact, Past, Present and Future”) that the ASA leade... (more)

Integrating R with production systems using an HTTP API

by Nick Elprin, Co-Founder of Domino Data Lab We built a platform that lets analysts deploy R code to an HTTP server with one click, and we describe it in detail below.  If you have ever wanted to invoke your R model with a simple HTTP call, without dealing with any infrastructure setup or asking for help from developers — imagine Heroku for your R code — we hope you’ll enjoy this. Introduction Across industries, analytical models are powering core business processes and applications as more companies realize that that analytics are key to their competitiveness.  R is particularl... (more)

A Data Scientist's and R User's Guide to the JSM

by Joseph Rickert The Joint Statistical Meetings (JSM) get underway this weekend in Boston and Revolution Analytics is again proud to be a sponsor. More than 6,000 statisticians and data scientists from around the world are expected to attend and listen to thousands of presentations. It is true that many talks will be on specialized topics that only statisticians working in particular a field will have the interest and patience to sit through. However, there is evidence that the conference will have something exciting to offer data scientists and statisticians working in industry... (more)

Revisiting package dependencies

by Andrie de Vries In my previous post I wrote about how to identify and visualize package dependencies.  Within hours, Duncan Murdoch (member of R-core) identified some discrepancies between my list of dependencies and the visualisation.  Since then, I fixed the dispecrancies. In this blog post I attempt to clarify the issues involved in listing package dependencies. In miniCRAN I expose two functions that provides information about dependencies: The function pkgDep() returns a character vector with the names of dependencies. Internally, pkgDep() is a wrapper around tools::pac... (more)

Because it's Friday: BuzzFeed for Data Scientists

There's a new online lifestyle magazine for data scientists with a machine-learning bent: ML Daily. (Thanks to reader SG for the tip.) Check it out for lots of useful articles, including: Shockingly Effective Trick To Maximize your Margin!!! (Learn any problem with just 10 training samples!) How Kim Kardsashian keeps of the pounds with the Kardashian Kernel!! Five signs you'll overfit and die! A spectral approach to ghost detection: fitting a paranormal distribution Cat Basis Purrsuit -- feline-related work for the machine learning and computer vision communities! And much much m... (more)