Zach Radka

Platform Architecture

Specialties

Hadoop and Big Data Systems
Cloud Architecture
Large-scale Databases
Software Engineering

Education

M.S. Computer Engineering, Johns Hopkins University
B.S. Computer Engineering, UMBC

Location

Maryland

Recent Projects

Large Pharmaceutical Company
Large-scale Clinical Trial Data Lake on Hadoop
Project lead and platform architect for the implementation of a large-scale graph datastore built with Accumulo, Hortonworks Data Platform (Hadoop), and Amazon Web Services.

Civilian Government Agency
Greenplum Data Warehouse in AWS
Designed and implemented a large-scale Greenplum database on AWS that utilized Apache Spark to ingest data from disparate sources. Implemented and designed several ETL pipelines for this system, implemented security controls with Amazon KMS, and integrated with Splunk for logging.

Technical Expertise

  • Hadoop: Spark, Mapreduce, Ambari, Accumulo, HBase, Hive
  • AWS: EC2, Lambda, CloudFormation, RDS, SQS, KMS
  • Databases: Greenplum, Dynamo, MySQL, PostgreSQL, Oracle
  • DevOps: Jenkins, Docker, Vagrant
  • Software Engineering: Java. Python, C#, Javascript