Search results for "Spark"

Open Knowledge Base

Home
Hadoop
MapR
Apache Hive
Apache Drill
Apache Spark
Cloudera Impala
JAVA
OS
[Follow ME]

Showing posts with label Spark. Show all posts

Showing posts with label Spark. Show all posts

Wednesday, March 10, 2021

Understanding RAPIDS Accelerator For Apache Spark parameter -- spark.rapids.memory.gpu.allocFraction and GPU pool related ones.

Tuesday, March 9, 2021

Understanding RAPIDS Accelerator For Apache Spark parameter -- spark.rapids.memory.pinnedPool.size

Thursday, March 4, 2021

Error java.lang.NoSuchMethodException when running spark-sql-perf with Hive Metastore 3.x

Thursday, February 25, 2021

Spark Code -- Unified Memory Manager

Wednesday, February 24, 2021

Spark on GPU -- Hands on GCP Dataproc to test Spark on GPU using RAPIDS

Wednesday, February 17, 2021

Spark Tuning -- Understanding the Spill from a Cartesian Product

Tuesday, February 16, 2021

Spark Tuning -- explaining Spark SQL Join Types

Wednesday, February 10, 2021

Spark Code -- Which Spark SQL data type isOrderable?

Monday, February 8, 2021

Spark Tuning -- Understand Cost Based Optimizer in Spark

Friday, February 5, 2021

How to generate TPC-DS data and run TPC-DS performance benchmark for Spark

Thursday, February 4, 2021

Spark Tuning -- How to use SparkMeasure to measure Spark job metrics

Wednesday, February 3, 2021

Spark Tuning -- Predicate Pushdown for Parquet

Tuesday, February 2, 2021

Spark Tuning -- Column Projection for Parquet

Spark Tuning -- Use Partition Discovery feature to do partition pruning

Thursday, January 28, 2021

Spark Code -- Use date_format() to convert timestamp to String

Spark Code -- How to drop Null values in DataFrame/Dataset

Spark Code -- How to replace Null values in DataFrame/Dataset

Tuesday, February 11, 2020

How to check if Spark job runs out of quota in CSpace

Friday, January 31, 2020

Hands-on MKE(MapR Kubernetes Ecosystem ) 1.0 release

Friday, December 6, 2019

Spark Streaming sample scala code for different sources

Prev Page Next Page Home

Subscribe to: Posts (Atom)

Popular Posts

How to check JAVA memory usage

Many commands can check the memory utilization of JAVA processes, for example, pmap, ps, jmap, jstat. What are the differences? Before we ...
How to control the file numbers of hive table after inserting data on MapR-FS.

Hive table contains files in HDFS, if one table or one partition has too many small files, the HiveQL performance may be impacted. Sometime...
How to use Scala on Spark to load data into Hbase/MapRDB -- normal load or bulk load.

This article shows a sample code to load data into Hbase or MapRDB(M7) using Scala on Spark. I will introduce 2 ways, one is normal load us...
Understanding Hive joins in explain plan output

Hive is trying to embrace CBO(cost based optimizer) in latest versions, and Join is one major part of it. Understanding join best practices ...
Scala on Spark cheatsheet

This is a cookbook for scala programming. 1. Define a object with main function -- Helloworld. object HelloWorld { def main(args: Array...
How to build and use parquet-tools to read parquet files

Goal: How to build and use parquet-tools to read parquet files. Solution: 1. Download and Install maven. Follow below link: http://...
Memory allocation for Oozie Launcher job

Goal: This article explains the configuration parameters for Oozie Launcher job.
Hive on Tez : How to control the number of Mappers and Reducers

Goal: How to control the number of Mappers and Reducers in Hive on Tez.
Understand Decimal precision and scale calculation in Spark using GPU or CPU mode

Goal: This article research on how Spark calculates the Decimal precision and scale using GPU or CPU mode. Basically we will test Addition/S...
COPY into PostgreSQL fails with error "invalid byte sequence for encoding "UTF8""

Env: PostgreSQL or Greenplum Symptom: COPY from a file into a table fails with error: ERROR: invalid byte sequence for encoding "...

Search inside OpenKB.info

About OpenKB

OpenKB is just my personal technical memo to record and share knowledge.

Author: Hao Zhu

It may not be accurate, it may be out of date, it may be exactly what you want.
Any questions, please feel free to comment under the specific blog section.

San Jose,US :
Beijing,China:

Labels

Blog Archive

Followers

© 2016 OPENKB.INFO ALL RIGHTS RESERVED.