Tony Truong
  • Home
Subscribe
Tagged

mapreduce

A collection of 3 posts

Scala

Running An Apache Spark Application on Amazon Elastic MapReduce

This is a series of guided screenshots on how to run an AWS EMR Spark application. Last time we wrote a spark count application that found the list of channels with more than 24 hours of programming. We will run that same application this time on EMR instead of the

Tony Truong Nov 25, 2015 • 3 min read
spark

Running A Count With MapReduce in Apache Spark

Apache Spark Snippet - Counts# This is the first in a series of snippets on Apache Spark programs. In a previous post I ran a machine learning algorithm through Spark and will be following a similar setup using the Hortonworks Sandbox. In the future I'll do some snippets on AWS'

Tony Truong Nov 12, 2015 • 3 min read
spark

Predicting Movie Ratings with Apache Spark, and Hortonworks

Today's goal is to make a prediction on a movie's rating based on its synopsis using machine learning in an environment that could scale out to hundreds or even thousands of nodes. As the title suggests, I'll be doing it on Apache Spark using MLlib written in Scala. I wanted

Tony Truong May 11, 2015 • 7 min read
Tony Truong © 2022
Powered by Ghost