big data

Using Spark Dynamic Allocation

Igor Berman | 24 Jun 2020 | Big Data

Tags: big data, dynamic allocation, infra, mesos, performance, Spark

The story starts with metrics. Every mature software company needs to have a metric system to monitor resource Read More...

Collaborative Trial: On Optimizing Recommendation Testing

Maoz Cohen | 09 Jun 2020 | Big Data

Tags: a/b testing, algorithms, big data, data, data science, Monitoring, performance, statistics, testing

Taboola is responsible for billions of daily recommendations, and we are doing everything we can to make those Read More...

Exploiting Multi-Categorical Features Using Deep Interest

Marina Gandlin | 04 Sep 2019 | Data Science

Tags: algorithms, big data, data, data model, data science, deep learning, machine learning, neural networks

At Taboola, our goal is to predict whether users will click on the ads we present to them. Read More...

Bucket the shuffle out of here!

Igor Berman and Radik Komarnitsky | 28 Mar 2019 | Big Data

Tags: big data, data, performance, shuffles, Spark, Spark-on-demand, tips

Intro At Taboola we use Spark extensively throughout the pipeline. Regularly faced with Spark-related scalability challenges, we look Read More...

How I Resolved Delays in Kafka Messages by Prioritizing Kafka Topics

Gal Shelach | 17 Oct 2018 | Big Data

Tags: big data, java, kafka, tips

As a team member in the Scale Performance Data group of Taboola’s R&D, I had the opportunity to Read More...