|
Book details / order |
PARALLEL R: DATA ANALYSIS IN THE DISTRIBUTED WORLD |
It’stough to argue with r as a high-quality, cross-platform, open sourcestatistical software product—unless you’re in the business of crunching bigdata. this concise book introduces you to several strategies for using r toanalyze large datasets. you’ll learn the basics of snow, multicore, parallel,and some hadoop-related tools, including how to find them, how to use them,when they work well, and when they don’t.
withthese packages, you can overcome r’s single-threaded nature by spreading workacross multiple cpus, or offloading work to multiple machines to address r’smemory barrier.
snow: works well in a traditional cluster environment
multicore: popular for multiprocessor and multicore computers
parallel: part of the upcoming r 2.14.0 release
r+hadoop: provides low-level access to a popular form of cluster computing
rhipe: uses hadoop’s power with r’s language and interactive shell
segue: lets you use elastic mapreduce as a backend for lapply-style operations
Author : Stephen weston, q ethan mccallum
Publication : Oreilly
Isbn : 9789350236802
Store book number : 105
NRS 440.00
|
|
|
|
|
|
|
|
|
|