logo

Spark-on-demand

Bucket the shuffle out of here!

Igor Berman and Radik Komarnitsky | 28 Mar 2019 | Big Data

Tags: big data, data, performance, shuffles, Spark, Spark-on-demand, tips

Intro At Taboola we use Spark extensively throughout the pipeline. Regularly faced with Spark-related scalability challenges, we look Read More...

Most Popular

  1. 5 Simple tips for boosting your Jenkins performance
  2. More than one Graph - Code Reuse in TensorFlow
  3. Predicting Probability Distributions Using Neural Networks
  4. Deep Multi-Task Learning - 3 Lessons Learned
  5. Deep Learning - from Prototype to Production

Categories

  • Big Data
  • Culture
  • Data Science
  • Java
    • Concurrency
  • Javascript
  • Machine Learning
  • System
  • Tips and Tricks
  • Tutorials
  • Uncategorized
  • Web Development

Archive

  • 2020 (11)
    • Nov (1)
    • Oct (1)
    • Sep (1)
    • Aug (2)
    • Jul (1)
    • Jun (2)
    • May (2)
    • Feb (1)
  • 2019 (29)
    • Dec (3)
    • Nov (1)
    • Oct (1)
    • Sep (3)
    • Aug (5)
    • Jul (2)
    • Jun (2)
    • May (4)
    • Mar (1)
    • Feb (2)
    • Jan (5)
  • 2018 (22)
    • Dec (5)
    • Nov (4)
    • Oct (3)
    • Sep (1)
    • Aug (3)
    • Jul (1)
    • Jun (1)
    • May (1)
    • Apr (1)
    • Mar (1)
    • Feb (1)
  • 2017 (9)
    • Dec (1)
    • Nov (1)
    • Oct (1)
    • Sep (1)
    • Jul (2)
    • Jun (3)

Join Our Team of Tech Heroes

Search Jobs

About Taboola

Taboola is a world leader in data science and machine learning and in back-end data processing at scale. We specialize in advanced personalization, deep learning and machine learning. We have a large-scale data operation with over 500K requests/sec, 20TB of new data processed each day, real and semi real-time machine learning algorithms trained over petabytes of data, and more. Our worldwide reach provides every single engineer the opportunity to influence how consumers discover and consume content across the globe.
www.taboola.com / careers.taboola.com

  • Home
  • About Us