Parallel R: Data Analysis in the Distributed World (1)
By: and
Sign Up Now!
Already a Member? Log In
You must be logged into Bookshare to access this title.
Learn about membership options,
or view our freely available titles.
- Synopsis
- It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t.With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.Snow: works well in a traditional cluster environmentMulticore: popular for multiprocessor and multicore computersParallel: part of the upcoming R 2.14.0 releaseR+Hadoop: provides low-level access to a popular form of cluster computingRHIPE: uses Hadoop’s power with R’s language and interactive shellSegue: lets you use Elastic MapReduce as a backend for lapply-style operations
- Copyright:
- 2011
Book Details
- Book Quality:
- Publisher Quality
- Book Size:
- 126 Pages
- ISBN-13:
- 9781449320331
- Related ISBNs:
- 9781449320348, 9781449320324, 9781449309923
- Publisher:
- O'Reilly Media
- Date of Addition:
- 02/06/25
- Copyrighted By:
- Q. Ethan McCallum and Stephen Weston
- Adult content:
- No
- Language:
- English
- Has Image Descriptions:
- No
- Categories:
- Nonfiction, Computers and Internet, Mathematics and Statistics
- Submitted By:
- Bookshare Staff
- Usage Restrictions:
- This is a copyrighted book.
Reviews
Other Books
- by Q. Ethan McCallum
- by Stephen Weston
- in Nonfiction
- in Computers and Internet
- in Mathematics and Statistics