Recommands

Apache Spark

Memory management
- APACHE SPARK : MEMORY MANAGEMENT AND GRACEFUL DEGRADATION
- Project Tungsten: Bringing Spark Closer to Bare Metal
  1. Memory Management and Binary Processing: leveraging application semantics to manage memory explicitly and eliminate the overhead of JVM object model and garbage collection.
  2. Cache-aware computation: algorithms and data structures to exploit memory hierarchy.
  3. Code generation: using code generation to exploit modern compilers and CPUs.
- Deep Dive: Memory Management in Apache Spark
  - Tuning Java Garbage Collection for Spark Applications
Performance
- How-to: Tune Your Apache Spark Jobs (Part 1), How-to: Tune Your Apache Spark Jobs (Part 2)
- Top 3 Troubleshooting Tips To Keep You Sparking
Tutorials and Courses
Others

Streaming

MICROBENCHMARKING APACHE STORM 1.0 PERFORMANCE
High-throughput, low-latency, and exactly-once stream processing with Apache Flink
Comparison of Apache Stream Processing Frameworks: Part 1, Part 2
Carrier Payments Big Data Pipeline using Apache Storm
Introduction to Apache Flink
Throughput, Latency, and Yahoo! Performance Benchmarks. Is there a winner?

JAVA

Python

Understanding Python Decorators in 12 Easy Steps

Distributed System

Machine Learning

Git

About Trajectory Data

All 1.1 Billion Taxi Rides on Redshift
Monitoring Real-Time Uber Data Using Spark Machine Learning, Streaming, and the Kafka API. part 1, part 2
DETECTING ABUSE AT SCALE: LOCALITY SENSITIVE HASHING AT UBER ENGINEERING

Insights

Tech Stack

THE UBER ENGINEERING TECH STACK, PART I: THE FOUNDATION
THE UBER ENGINEERING TECH STACK, PART II: THE EDGE AND BEYOND
How Uber Uses Spark and Hadoop to Optimize Customer Experience
PayPal From Big Data to Fast Data: Part 1, Part 2

Algorithms

Algorithms and Data Structures

Open Source

Serverless

-Serverless on Kubernetes with Soam Vasani

Management

What to do when your startup needs program management

Incredible Products/Sites

Other Materials