Skip Navigation

Entry level Data Engineer



Category

Information Technology

Job Location

Remote

Tracking Code

083021

Position Type

Full-Time/Regular

About Us

Certilytics provides sophisticated predictive analytics solutions to major healthcare organizations by integrating financial, clinical, and behavioral insights. Our team represents a dynamic infusion of multidiscipline, which includes actuarial, data and behavioral scientists, IT engineers, software developers, nurse clinicians, as well as experts in public health and the health insurance industry. Certilytics has extensive experience working with a diverse set of customers including large self-insured employers, health plans, pharmacy benefit managers, government programs, care management companies and health systems. These relationships with various data providers and customers allows for rapid data ingestion, validation and enrichment as well as streamlined delivery of analytic dashboards, outputs and visualizations to our customers. Our unique approach allows for the development of the most accurate financial, clinical and behavioral models in the industry.

The Role

We’re seeking an entry level data engineer to join the experts on Certilytics’ data science team. The data engineer is responsible for expanding, optimizing, and troubleshooting our data pipelines which receive, ingest, process and extract data that serve both internal and external needs. This position will support our operational and business objectives, working with data analysts on existing and new data initiatives. As part of the Data Science team, you will focus on expanding our capabilities in managing and integrating new and existing data sources. You should be self-directed and comfortable supporting the data needs of multiple teams, systems and products.

 


Required Skills

What You'll Accomplish

  • Monitor the state of the database used by the Data Science team to ensure data correctness and availability.
  • Manage the movement of data between environments to support model development.
  • Gathering and collating new data sources for use by the Data Science team
  • Create and maintain data pipelines to feed into our data lake.
  • Optimize and troubleshoot complex joins across massive data sets (billions of records).
  • Identify, design, implement, and own internal process improvements; Automating manual processes, optimizing data delivery and ingestion.
  • Respond to operational data pipeline failures with ownership and accuracy.
  • Assemble large, complex data sets that meet business requirements.
  • Evaluate logs across a wide range of complex systems.
  • Work with data experts to strive for greater functionality in our data platform.
  • Generate accurate and effective documentation.

Required Experience

Required Skills

What You’ll Need

  • Ideal for new grad or someone with 1-2 years of industry experience
  • Provable working SQL knowledge and experience working with a variety of databases.
  • Successful history of manipulating, transforming, and extracting value from large data sets.
  • Strong project management and organizational skills.
  • Knowledge of programming language (Python, Java are great!)
  • Ability to support and work with cross-functional teams in a dynamic environment.
  • Excellent communication (both written and verbal) combined with a strong collaborative drive.
  • Experience designing, building, and maintaining ‘big data’ data pipelines.
  • Function autonomously, as well as to communicate with partners and teammates across several time zones.
  • Technical mindset and desire to learn new tools and approaches.
  • Experience working with healthcare data is a plus.
  • Experience with big data tools: Hadoop, Hive, Spark, Yarn a plus.

How to stand out!

  • Master’s Computer Science.

close