On This Page

Crash Course in Data Science for Hackers

Booz | Allen | Hamilton | July 30-31 & August 1-2



Overview

This interactive course will teach network security professionals how to use data science techniques to quickly write scripts to manipulate and analyze network data. Students will learn techniques to rapidly write scripts to improve their work. Participants will learn now to read in data in a variety of common formats then write scripts to analyze and visualize that data. A non-exhaustive list of what will be covered include:

  • How to write scripts to read CSV, XML, and JSON files
  • How to quickly parse log files and extract artifacts from them
  • How to make API calls to merge datasets
  • How to use the Pandas library to quickly manipulate tabular data
  • How to effectively visualize data using Python
  • How to apply simple machine learning algorithms to identify potential threats

Finally, we will introduce the students to cutting edge Big Data tools including Apache Spark and Apache Drill, and demonstrate how to apply these techniques to extremely large datasets.

Who Should Take this Course

Anyone who wishes to incorporate automated data analysis into their work.

Student Requirements

Students will need to have a basic understanding of Python.

What Students Should Bring

Students should bring a laptop with either:
1. Virtualbox (or VMWare) installed, 4GB of RAM and 10GB of storage.
2. Anaconda and IPython installed.

What Students Will Be Provided With

A preconfigured virtual machine (VM) containing all the software needed for the class. The VM will also contain:
  • All course slides, handouts and documentation
  • Skeleton code examples for in-class exercises

Students will also be provided with access to our website which will have additional exercises.

Trainers

Charles Givre is an unapologetic data geek who is passionate about helping others learn about data science and become passionate about it themselves. He has worked at Booz Allen Hamilton for the last five years as a data scientist for various government clientsand done some really neat data science work along the way, which hopefully saves U.S. taxpayers some money. Most of his work has been in developing meaningful metrics to assess how well the workforce is performing. For the last two years, Charles has been part of the management team for one of the company's largest analytic contracts. His responsibility has been to increase the amount of data science on the contract, both in terms of tasks and people. Even more than the data science work, he loves learning about new technologies and techniques, and then teaching them. Charles has been instrumental in bringing Python scripting to his government clients, as well as to the analytic workforce. He has developed a 40-hour Introduction to Analytic Scripting class for that purpose. Additionally, he's developed and taught a 60-hour Fundamentals of Data Science class, which helps to put analysts on the data on-ramp. He's taught the class to Booz Allen staff, government civilians, and U.S. military personnel around the world. Charles has a Master's degree from Brandeis University, two Bachelor's degrees from the University of Arizona, and various IT security certifications. In his nonexistent spare time, he plays trombone, spends time with his family, and works on restoring British sports cars.

Emily Schumm is a life-long disciple of the liberal arts who loves data science as yet another tool she can use to win arguments. She has been with Booz Allen Hamilton for the last three years, and along the way, has worked problems for a variety of clients in the Cyber domain, whose solutions range from simple automation to cloud analytics to machine learning algorithms. But teaching is the work she loves most, and her work with Booz Allen has enabled her to teach multiple classes about Python and Fundamentals of Data Science to not only my peers but also women in the technical community. Ms. Schumm holds a Bachelor of Science in Mathematical Decision Sciences from the Department of Statistics and Operations Research at the University of North Carolina at Chapel Hill. Ms. Schumm has spent countless hours in the classroom teaching scripting, data munging, and data science skills to corporate professionals, military personnel and government civilians. When Ms. Schumm is not nerding out, she is busy celebrating her youth around the city, which entails leisurely bike rides, a cappella performances, and spontaneous dance-offs.