Search results for "Spark"

Open Knowledge Base

Home
Hadoop
MapR
Apache Hive
Apache Drill
Apache Spark
Cloudera Impala
JAVA
OS
[Follow ME]

Showing posts with label Spark. Show all posts

Showing posts with label Spark. Show all posts

Tuesday, July 19, 2022

Spark writing to S3 failed: java.lang.NoSuchMethodError: com.google.common.base.Preconditions.checkArgument

Spark writing to S3 failed: java.lang.NoSuchMethodError: org.apache.hadoop.util.SemaphoredDelegatingExecutor.

Thursday, September 23, 2021

How to access Azure Open Dataset from Spark

Monday, May 3, 2021

Understand Decimal precision and scale calculation in Spark using GPU or CPU mode

Thursday, April 29, 2021

How to use Spark Operator to run Spark job with Rapids Accelerator

Tuesday, April 27, 2021

Rapids Accelerator compatibility related to spark.sql.legacy.parquet.datetimeRebaseModeInWrite

Tuesday, April 20, 2021

Spark Code -- Dig into SparkListenerEvent

How to use latest version of Rapids Accelerator for Spark on EMR

Monday, April 12, 2021

How to use NVIDIA Nsight Systems to profile a Spark on K8s job with Rapids Accelerator

Thursday, April 8, 2021

How to use NVIDIA Nsight Systems to profile a Spark job on Rapids Accelerator

Sunday, April 4, 2021

How to enable GpuKryoRegistrator on RAPIDS Accelerator for Spark

Thursday, March 25, 2021

concat_ws example on Spark with RAPIDS Accelerator

Wednesday, March 24, 2021

Hands-on native cuDF Pandas UDF

Tuesday, March 23, 2021

How to run the pandas cudf_udf test for RAPIDS Accelerator for Apache Spark

Friday, March 19, 2021

Understanding RAPIDS Accelerator For Apache Spark's supported timezone

Thursday, March 18, 2021

Spark Tuning -- Adaptive Query Execution(3): Dynamically optimizing skew joins

What Dataset API is not supported for RAPIDS Accelerator for Apache Spark

Wednesday, March 17, 2021

Spark Tuning -- Adaptive Query Execution(2): Dynamically switching join strategies

Tuesday, March 16, 2021

Spark Tuning -- Adaptive Query Execution(1): Dynamically coalescing shuffle partitions

Monday, March 15, 2021

Spark Tuning -- Dynamic Partition Pruning

Subscribe to: Posts (Atom)

Popular Posts

How to check JAVA memory usage

Many commands can check the memory utilization of JAVA processes, for example, pmap, ps, jmap, jstat. What are the differences? Before we ...
How to control the file numbers of hive table after inserting data on MapR-FS.

Hive table contains files in HDFS, if one table or one partition has too many small files, the HiveQL performance may be impacted. Sometime...
How to use Scala on Spark to load data into Hbase/MapRDB -- normal load or bulk load.

This article shows a sample code to load data into Hbase or MapRDB(M7) using Scala on Spark. I will introduce 2 ways, one is normal load us...
Understanding Hive joins in explain plan output

Hive is trying to embrace CBO(cost based optimizer) in latest versions, and Join is one major part of it. Understanding join best practices ...
Scala on Spark cheatsheet

This is a cookbook for scala programming. 1. Define a object with main function -- Helloworld. object HelloWorld { def main(args: Array...
How to build and use parquet-tools to read parquet files

Goal: How to build and use parquet-tools to read parquet files. Solution: 1. Download and Install maven. Follow below link: http://...
Memory allocation for Oozie Launcher job

Goal: This article explains the configuration parameters for Oozie Launcher job.
Hive on Tez : How to control the number of Mappers and Reducers

Goal: How to control the number of Mappers and Reducers in Hive on Tez.
Understand Decimal precision and scale calculation in Spark using GPU or CPU mode

Goal: This article research on how Spark calculates the Decimal precision and scale using GPU or CPU mode. Basically we will test Addition/S...
COPY into PostgreSQL fails with error "invalid byte sequence for encoding "UTF8""

Env: PostgreSQL or Greenplum Symptom: COPY from a file into a table fails with error: ERROR: invalid byte sequence for encoding ...

Search inside OpenKB.info

About OpenKB

OpenKB is just my personal technical memo to record and share knowledge.

Author: Hao Zhu

It may not be accurate, it may be out of date, it may be exactly what you want.
Any questions, please feel free to comment under the specific blog section.

San Jose,US :
Beijing,China:

Labels

Blog Archive

Followers

© 2016 OPENKB.INFO ALL RIGHTS RESERVED.