Software Engineer, Data Infrastructure Engineering
Facebook is seeking a Software Engineer to join the Data team. The ideal candidate will dream about distributed systems for the parallel processing of massive quantities of data, be familiar with Hadoop/Pig/HBase and MapReduce/Sawzall/Bigtable, and frequently think to themselves, 'Yeah, that works for 500 MB of data; what about 500 TB?' This position is full-time and based in our Menlo Park office.
Responsibilities
Develop reusable tools for the management and manipulation of hundreds of terabytes of data
Analyze and improve efficiency, scalability, and stability of data collection, storage, and retrieval processes
Interface with other teams to incorporate their innovations and vice versa
Work closely with data analysts to construct creative solutions for their analysis tasks
Optimize our infrastructure at both the software and hardware level
Focus on scaling horizontally rather than vertically
Requirements
Extensive experience working with large data stores
Deep understanding of storage appliance and storage software internals
Expert knowledge developing and debugging in C/C++ or Java on *nix
Experience developing with Python, Ruby, or PHP
Experience with filesystems, concurrency, multithreading, server architectures, and distributed systems
Menlo Park, CA 94025
Full Time