Senior/Principal Big Data Engineer - Data Modeling

Baltimore, Maryland, U.S.

Full-Time

TLDR

Design and build a big data infrastructure platform using patented ML for cross-domain data modeling, with early ownership of technology choices.

Company Description

FourV Systems, LLC is a global leader in data analytics empowering enterprises to derive more knowledge and intelligence by correlating historical and real-time data. Our products bridge the gap between scheduled analytics of stored data and real-time streaming analytics on big, fast data sets. We apply patented proprietary machine learning algorithms and natural language processing capabilities to unlock the power of stored and streaming cross-domain data in transformative ways.

Are you a software developer with big, fast data experience looking to make an impact with a rapidly growing team of talented developers working on delivering a transformative product? Are you highly motivated to make a real impact in the trajectory of our company? If so, FourV Systems is looking for developers and engineers just like you.

At FourV Systems, we believe our employees are our greatest asset. We offer competitive salaries, a full benefits package with available medical, dental, vision, and disability insurance, and a 401k retirement package. We offer paid time off, a well-stocked kitchen, and team building activities. Most importantly, we offer the opportunity to build a world class product with a team of talented engineers. At FourV Systems, we help our partners Know More, Now™.

Job Description

In the Senior/Principal Big Data Engineer – Data Modeling role, you will be responsible for participating in the design and development efforts of our big data infrastructure platform using patented machine learning algorithms in coming up with innovative ideas for cross-domain correlation and modeling of large amounts of data.

This is a great opportunity to join early and influence the technology selection, design, and have ownership of the infrastructure.

Key Responsibilities

Generate software designs and determine levels of effort for new product features
Develop software written in Java and other languages as needed
Develop the modeling layer of the GreySpark platform
Work closely with QA and other software engineers on Agile teams to deliver quality features
Participate in architecture discussions to address long-term design objectives

Skills & Experience

5+ years of software development experience in a Unix/Linux environment
Application of data modeling in multiple paradigms, including object-oriented, relational, document, and key-value
Experience with many data input formats and representations, including binary serialization formats such as Avro, Thrift, or Protobufs, and textual documents, CSVs, and other structured or semi-structured records
Experience building distributed system products including key-value or column-oriented NoSQL databases such as Casandra, Redis, or MemSQL
Hadoop environment experience – preferably Hadoop 2 with YARN – and familiarity with CDH, MapR and/or HortonWorks deployments
Message queue operation and integration experience, such as Apache Kafka, RabbitMQ, or Nifi
Familiarity with data structure optimization for execution of distributed algorithms
Full software development life-cycle experience with proven track record of shipping quality software products on-time
Experience with public, private, and hybrid cloud implementations preferably AWS or Azure
Apache Spark experience – including production experience using SparkSQL and Spark Streaming or Apache Storm a strong plus
Four year degree in CS, CE, EE, or other technical field such as math, physics, or engineering is required – a graduate degree in CS, CE, EE is a plus.