Senior/Principal Big Data Engineer - Data Modeling
TLDR
Design and build a big data infrastructure platform using patented ML for cross-domain data modeling, with early ownership of technology choices.
Company Description
FourV Systems, LLC is a global leader in data analytics empowering enterprises to derive more knowledge and intelligence by correlating historical and real-time data. Our products bridge the gap between scheduled analytics of stored data and real-time streaming analytics on big, fast data sets. We apply patented proprietary machine learning algorithms and natural language processing capabilities to unlock the power of stored and streaming cross-domain data in transformative ways.
Are you a software developer with big, fast data experience looking to make an impact with a rapidly growing team of talented developers working on delivering a transformative product? Are you highly motivated to make a real impact in the trajectory of our company? If so, FourV Systems is looking for developers and engineers just like you.
At FourV Systems, we believe our employees are our greatest asset. We offer competitive salaries, a full benefits package with available medical, dental, vision, and disability insurance, and a 401k retirement package. We offer paid time off, a well-stocked kitchen, and team building activities. Most importantly, we offer the opportunity to build a world class product with a team of talented engineers. At FourV Systems, we help our partners Know More, Now™.
Job Description
In the Senior/Principal Big Data Engineer – Data Modeling role, you will be responsible for participating in the design and development efforts of our big data infrastructure platform using patented machine learning algorithms in coming up with innovative ideas for cross-domain correlation and modeling of large amounts of data.
This is a great opportunity to join early and influence the technology selection, design, and have ownership of the infrastructure.
Key Responsibilities
- Generate software designs and determine levels of effort for new product features
- Develop software written in Java and other languages as needed
- Develop the modeling layer of the GreySpark platform
- Work closely with QA and other software engineers on Agile teams to deliver quality features
- Participate in architecture discussions to address long-term design objectives
Skills & Experience
- 5+ years of software development experience in a Unix/Linux environment
- Application of data modeling in multiple paradigms, including object-oriented, relational, document, and key-value
- Experience with many data input formats and representations, including binary serialization formats such as Avro, Thrift, or Protobufs, and textual documents, CSVs, and other structured or semi-structured records
- Experience building distributed system products including key-value or column-oriented NoSQL databases such as Casandra, Redis, or MemSQL
- Hadoop environment experience – preferably Hadoop 2 with YARN – and familiarity with CDH, MapR and/or HortonWorks deployments
- Message queue operation and integration experience, such as Apache Kafka, RabbitMQ, or Nifi
- Familiarity with data structure optimization for execution of distributed algorithms
- Full software development life-cycle experience with proven track record of shipping quality software products on-time
- Experience with public, private, and hybrid cloud implementations preferably AWS or Azure
- Apache Spark experience – including production experience using SparkSQL and Spark Streaming or Apache Storm a strong plus
- Four year degree in CS, CE, EE, or other technical field such as math, physics, or engineering is required – a graduate degree in CS, CE, EE is a plus.
Join our team today and Know More Now™ with FourV Systems.
Benefits
Free Meals & Snacks
well-stocked kitchen
Health Insurance
full benefits package with available medical, dental, vision, and disability insurance
401k retirement plan
401k retirement package
Paid Time Off