Spark good readings | Open Knowledge Base

Open Knowledge Base

Home
Hadoop
MapR
Apache Hive
Apache Drill
Apache Spark
Cloudera Impala
JAVA
OS
[Follow ME]

Monday, July 21, 2014

Spark good readings

1. Spark and Shark(Slideshare)
2. Apache Spark(Slideshare)
3. Lightening fast big data analytics using apache spark(Slideshare)
4. Spark Documentation
5. Putting Spark to Use: Fast In-Memory Computing for Your Big Data Applications
6. RDD
7. Persisting RDD in Spark
8. Scala on Spark function samples
9. How-to: Tune Your Apache Spark Jobs (Part 1)
10. How-to: Tune Your Apache Spark Jobs (Part 2)
11. Apache Spark Resource Management and YARN App Models
==

Posted by OpenKB at 4:02 PM

Email This BlogThis!Share to X Share to Facebook Share to Pinterest

Labels: Good Reading, Spark

No comments:

Post a Comment

Prev Page Next Page Home

Subscribe to: Post Comments (Atom)

Popular Posts

How to check JAVA memory usage

Many commands can check the memory utilization of JAVA processes, for example, pmap, ps, jmap, jstat. What are the differences? Before we ...
How to control the file numbers of hive table after inserting data on MapR-FS.

Hive table contains files in HDFS, if one table or one partition has too many small files, the HiveQL performance may be impacted. Sometime...
How to use Scala on Spark to load data into Hbase/MapRDB -- normal load or bulk load.

This article shows a sample code to load data into Hbase or MapRDB(M7) using Scala on Spark. I will introduce 2 ways, one is normal load us...
Understanding Hive joins in explain plan output

Hive is trying to embrace CBO(cost based optimizer) in latest versions, and Join is one major part of it. Understanding join best practices ...
Scala on Spark cheatsheet

This is a cookbook for scala programming. 1. Define a object with main function -- Helloworld. object HelloWorld { def main(args: Array...
How to build and use parquet-tools to read parquet files

Goal: How to build and use parquet-tools to read parquet files. Solution: 1. Download and Install maven. Follow below link: http://...
Memory allocation for Oozie Launcher job

Goal: This article explains the configuration parameters for Oozie Launcher job.
Hive on Tez : How to control the number of Mappers and Reducers

Goal: How to control the number of Mappers and Reducers in Hive on Tez.
Understand Decimal precision and scale calculation in Spark using GPU or CPU mode

Goal: This article research on how Spark calculates the Decimal precision and scale using GPU or CPU mode. Basically we will test Addition/S...
COPY into PostgreSQL fails with error "invalid byte sequence for encoding "UTF8""

Env: PostgreSQL or Greenplum Symptom: COPY from a file into a table fails with error: ERROR: invalid byte sequence for encoding ...

Search inside OpenKB.info

About OpenKB

OpenKB is just my personal technical memo to record and share knowledge.

Author: Hao Zhu

It may not be accurate, it may be out of date, it may be exactly what you want.
Any questions, please feel free to comment under the specific blog section.

San Jose,US :
Beijing,China:

Labels

Blog Archive

Followers

© 2016 OPENKB.INFO ALL RIGHTS RESERVED.