GTC 2023: Nvidia shares how Rapids can future-proof Apache Spark
news.dailyheadliner.com
Thursday, June 8, 2023
No Result
View All Result
  • Login
  • Home
  • Business Blues
  • Crypto Snooze
  • funances
  • Health Crap
  • Politricks
  • Stuff to Buy
  • More Sports
  • Stocks
  • Tech Again
  • Travel Woes
  • Home
  • Business Blues
  • Crypto Snooze
  • funances
  • Health Crap
  • Politricks
  • Stuff to Buy
  • More Sports
  • Stocks
  • Tech Again
  • Travel Woes
No Result
View All Result
news.dailyheadliner.com
No Result
View All Result
Home Business

GTC 2023: Nvidia shares how Rapids can future-proof Apache Spark

March 24, 2023
in Business
0 0
NDS benchmarks


Join top executives in San Francisco on July 11-12, to hear how leaders are integrating and optimizing AI investments for success. Learn More


Following the initial rise of Hadoop, data teams across industries have adopted Apache Spark as the go-to framework for distributed big data processing. The open-source platform has largely replaced Hadoop’s Mapreduce by enabling faster in-memory processing of datasets, and handling use cases that Hadoop could not manage. Spark is also more accessible in terms of APIs, and backed with adequate fault tolerance.

However, with the amount of data in the world predicted to grow to 221 zettabytes by 2026, it’s difficult for organizations to get a grip on the information they have. At current processing speeds, companies will face latencies in business applications like analytics. And if they move to increase speeds, the costs rise.

That’s why teams should look at the option of accelerating Spark with GPUs, via Rapids, said Sameer Raheja, senior director of engineering at Nvidia, at the ongoing GTC 2023 conference. 

>>Follow VentureBeat’s ongoing Nvidia GTC spring 2023 coverage<<

Event

Transform 2023

Join us in San Francisco on July 11-12, where top executives will share how they have integrated and optimized AI investments for success and avoided common pitfalls.

 


Register Now

GPU-accelerated Apache Spark 

To handle future data demands with Spark, Raheja suggested running the framework with Nvidia GPUs. A plugin jar like Rapids Accelerator for Apache Spark, he said, can allow Spark batch processing to run on GPUs without any code changes.

This, he said, will not only enable teams to run massive data jobs faster at a lower cost than is possible with CPUs, it will also drive power savings.

ADVERTISEMENT

Rapids Accelerator for Apache Spark combines the power of the Rapids cuDF library and the scale of the Spark distributed computing framework. The Rapids Accelerator library also has a built-in accelerated shuffle based on UCX that can be configured to leverage GPU-to-GPU communication and remote direct memory access capabilities.

Using the Nvidia decision support benchmark — an adaptation of the industry-standard TPC-DS benchmark, with 100 modified queries — the company compared a Rapids-based GPU-accelerated Google cloud dataproc Spark distribution with one based on CPUs. The GPU nodes did a power run of all 100 queries in just 31 minutes, versus 176 minutes taken by the CPU nodes.

Since the GPU run took less time, it also proved to be more affordable than CPU nodes, costing just $7.20 as against $32.52 for the CPU run. The GPU run was five times more power-efficient.

NDS benchmarks

“For anyone who’s running big data workloads and managing a budget … performance, cost and efficiency are key factors, and Rapids Accelerator for Spark addresses all three,” Raheja emphasized.

He added that similar benchmark results were witnessed on other clouds and Spark distributions with configurations closely matching that of Dataproc. For example, Rapids-accelerated AWS EMR distribution saw a 42% cost savings, while AWS Databricks Photon and Azure Databricks Photon delivered 39% and 34% cost savings, respectively.

Screenshot 2023 03 23 005108 GTC 2023: Nvidia shares how Rapids can future-proof Apache Spark
Savings across different clouds

How it works

The key to these benefits is Apache Spark 3, which brings column-based processing and resource-aware custom resource scheduling capabilities. This allows teams to schedule tasks on accelerator resources like GPUs.

“You can continue to write your application in the APIs you’re familiar with — SQL, Python, R, Java and Scala. Spark provides distributed and scale-up compute power; Spark 3.x provides resource-aware scheduling; and the Rapids Accelerator for Apache Spark plugin provides transparency for applications to run on Nvidia GPUs, enabling acceleration in cooperation with [the] Spark core engine’s built-in processor,” Raheja said.

Currently, the Rapids Spark accelerator is available on and built into Amazon EMR, Cloudera CDP, Databricks ML runtime, Azure Synapse Analytics, Google Cloud Dataproc, and open-source Apache Spark 3.x distributions, either on-premises or in the cloud.

The 2023 Nvidia GTC event runs through March 23.

VentureBeat’s mission is to be a digital town square for technical decision-makers to gain knowledge about transformative enterprise technology and transact. Discover our Briefings.

Share this:

  • Twitter
  • Facebook
  • LinkedIn
  • Reddit
  • Tumblr
  • Pinterest
  • Pocket
  • Telegram
  • WhatsApp
  • Skype

Related

Tags: ApacheFutureProofGTCNvidiaRapidssharesspark

Related Posts

Chris Williams
Business

Ex-Microsoft VP shares 3 lessons learned from Bill Gates, who is known for always being right.

0
Consensus 2023 Nav Bar
Crypto News

ARK Invest Scoops Up Coinbase, Block Shares for Second Straight Day

0
Consensus 2023 Nav Bar
Crypto News

ARK Uses ‘Wells Dip’ to Stock Up Again on Coinbase Shares, Two Days After Selling

0
What’s it like to be a top NFL free agent? Mike McGlinchey shares everything about his ‘wild process’
Sports

What’s it like to be a top NFL free agent? Mike McGlinchey shares everything about his ‘wild process’

0
Brazil left searching for a spark while Argentina parties
Sports

Brazil left searching for a spark while Argentina parties

0
Nvidia AI Foundations illustrations
Business

As Nvidia pushes to democratize AI, here’s everything it announced at GTC 2023

0
Load More

Top Posts & Pages

  • Bitcoin's Drop Below $26K Causes Over $300M in Liquidations.
    Bitcoin's Drop Below $26K Causes Over $300M in Liquidations.
  • Rare “Gorilla Cherry” Secret Helps Support A Healthy Prostate
    Rare “Gorilla Cherry” Secret Helps Support A Healthy Prostate
  • "New App to Collect Data of Presidential Proportions Unleashed!"
    "New App to Collect Data of Presidential Proportions Unleashed!"
  • Spiritual Salt: Brand New Spirituality Offer, High-Conversion Machine
    Spiritual Salt: Brand New Spirituality Offer, High-Conversion Machine
  • "Uniworld sets sail on its first-ever world snooze: Watch wealthy passengers nap their way through exotic destinations. #yawn #missingoutonadventure"
    "Uniworld sets sail on its first-ever world snooze: Watch wealthy passengers nap their way through exotic destinations. #yawn #missingoutonadventure"
  • Trump Executive Privilege Claim Shattered As Judge Orders Mark Meadows And Others To Testify
    Trump Executive Privilege Claim Shattered As Judge Orders Mark Meadows And Others To Testify
  • New Supplement May Change The Way You Diet

    New Supplement May Change The Way You Diet

    0 shares
    Share 0 Tweet 0
  • The Top 3 Fat Burning Teas, Which is right for you?

    0 shares
    Share 0 Tweet 0
  • Most startups have been overestimated sooner than 2021, and now it’s inflicting issues

    0 shares
    Share 0 Tweet 0
  • Is There a Real Cure For Diabetes

    0 shares
    Share 0 Tweet 0
  • Trump Executive Privilege Claim Shattered As Judge Orders Mark Meadows And Others To Testify

    0 shares
    Share 0 Tweet 0

Tags

adaderana (303) ada derana (303) adaderana.lk (303) Athlete (1348) Bitcoin (832) biz (486) Breaking News: Technology (355) breaking news in sri lanka (303) Business (785) business news (1327) Computer (262) Crypto (792) Cryptocurrency (271) Electronics (265) Extreme (1359) Football (1460) Golf (1371) Hockey (1361) Internet (354) lankan news (303) latest sri lankan news (303) Marathon (1359) Market (326) Markets (315) News (1219) Runner (1347) Running (1382) Shopping (347) Soccer (1526) Social media (293) Softball (1350) Software (487) Sports (1418) sri lanka business news (303) sri lanka gossip (303) sri lanka hot news (303) sri lanka news (303) sri lanka sports news (303) stocks (323) talking (492) TechCrunch (1622) Technology (1301) Tennis (1408) Training (1360) Travel (315)

Subscribe to Blog via Email

Enter your email address to subscribe and receive notifications of new posts by email.

ADVERTISEMENT
  • Home
  • Business Blues
  • Crypto Snooze
  • funances
  • Health Crap
  • Politricks
  • Stuff to Buy
  • More Sports
  • Stocks
  • Tech Again
  • Travel Woes
No Result
View All Result
  • Home
  • Business Blues
  • Crypto Snooze
  • funances
  • Health Crap
  • Politricks
  • Stuff to Buy
  • More Sports
  • Stocks
  • Tech Again
  • Travel Woes

© news - All Rights Are Reserved

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.