Data Flow and Data Engineering
B.S. Computer Science, UMUC
Large Health Insurance Company
Cybersecurity Data Pipeline
Developed a data pipeline with Apache NiFi that in real-time gathers 20 billion events per day from 60 different sources.
Large Financial Services Company
Large-scale Compliance Data Lake and Streaming Analytics System
Built several production Hadoop and Accumulo clusters on an on-premise cloud built on top of Mesos. Built applications that interact with and loaded data into these Accumulo clusters for a massive-scale compliance solution that services dozens of financial institutions’ email and chat data.
- Hadoop: MapReduce, Accumulo
- Apache NiFi
- Java: Spring, Tomcat, REST, Tomcat, Jetty, Jboss