Senior Data Engineer
Spokeo
About Spokeo
Join our mission to make the world more transparent with data.
Spokeo is a people search engine that helps over 18 million monthly visitors reconnect with friends, reunite with families, and protect against fraud. Additionally, our 12 billion records and over 250 million unique profiles help business professionals locate people and assets, research criminal investigation subjects, and more.
Founded in 2006, we have grown to a remote-first company of nearly 200 dedicated employees with an average tenure of 4.5 years. Find out why we were named a “Best Company” for 2023 by Comparably for Women, Compensation, Happiest Employees, Company Perks & Benefits, and Work-Life Balance, as well as “Best CEO” for co-founder Harrison Tang.
About this Opportunity
As a Senior Data Engineer at Spokeo, you will develop, optimize, and improve our data systems such as ETL data, pipeline, storage, and entity resolution. This involves working with infrastructure built in AWS, including Airflow, PySpark, EMR, S3, DynamoDB, and more. This role will help build and improve data products, automation platform features, analytical software packages, and data pipeline orchestration tools.
What you’ll do:
Build infrastructure and data automation pipelines for the ingestion, processing, and loading of data from various sources. Automate and integrate new components into the data pipeline.
Collaborate with stakeholders and data science teams to develop data products including entity resolution and best selection to efficiently execute product vision and strategy in alignment with organizational goals and priorities.
Create unit and stress test components to monitor technical performance and ensure identified issues are resolved.
Develop data analysis tools to provide data insights and capture key metrics.
Research solutions and maintain technical documentation.
Follow best practices for data governance, quality, cleansing, and other ETL-related activities.
Who You Are:
7+ years of development experience in data engineering within a production environment (internships and academic settings excluded).
Proven experience working with large datasets exceeding 100M+ records or multiple terabytes.
2+ years of development experience in highly scalable, distributed systems and cluster architectures using AWS.
5+ years of hands-on programming experience with Python.
5+ years of professional experience working in big data ecosystems, Spark is required; PySpark is preferable.
3+ years experience with SQL, schema design, and dimensional data modeling.
2+ years of professional experience working with dataflow orchestration tools, such as Airflow.
2+ years experience with non-relational databases (e.g., DynamoDB, Elasticsearch, etc.).
Prior experience working with large data sets (>100M+ records).
Bachelor’s degree in Computer Science, Information Systems, Mathematics, or a related field is required.
Working at Spokeo
Our mission is to advance transparency, and to achieve that goal, we rally around six core values: listening with empathy, understanding the why, clarifying with data, innovating to learn, collaborating to achieve, and insisting on quality.
As a remote-first company, we are able to hire team members residing in the following US states: AZ, CA, CO, FL, GA, KY, MD, MI, MO, NH, NJ, NV, NC, PA, SC, SD, TX, VA, WA or WY.
In addition to a highly competitive base salary, our generous benefits include:
participation in an individual annual bonus
stock options
401K
100% medical/dental/vision coverage
unlimited PTO
mental health resources
paid home office equipment
fitness reimbursements
support paying for courses
and more
We extend written offers to candidates who successfully complete their selection process. Offers will depend on several factors, including, but not limited to, marketplace competition, job leveling, experience, and skills.
Privacy Notice for Candidates: https://www.spokeo.com/recruiting-policy
Spokeo is an equal opportunity employer. Applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status. Spokeo fosters a business culture where ideas and decisions from all people help us grow, innovate, create the best products, and be relevant in a rapidly changing world.
Note: You must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of one’s employment visa at this time.
Recruiters or staffing agencies: Spokeo is not obligated to compensate any external recruiter or search firm who presents a candidate or their resume or profile to a Spokeo employee without 1) a current, fully executed agreement on file, and 2) being assigned to the open position (as a search) via our applicant tracking solution.
#LI-Remote