Role: Data Engineer II
Location: Seattle, WA,US
Duration: 9 Months
Job Description:
We’re looking for Senior Data Engineer to help us grow our Data Lake and Data Warehouse Systems, which is being built using a serverless architecture, with 100% native AWS components including Redshift Spectrum, Athena, S3, Lambda, Glue, EMR, Kinesis, SNS, CloudWatch and more! We own a world-class data lake that is used to drive multi-billion dollar decisions on a regular cadence and we're looking to improve on filling the lake quickly, with as little human intervention needed and democratize the data in the lake.
Our Data Engineers build the ETL and analytics solutions for our internal customers to answer questions with data and drive critical improvements for the business. Our Data Engineers use best practices in software engineering, data management, data storage, data compute, and distributed systems. We are passionate about solving business problems with data!
Required:
Develop and maintain automated ETL pipelines (with monitoring) using scripting languages such as Python, Spark, SQL and AWS services such as S3, Glue, Lambda, SNS, SQS, KMS.
Implement and support reporting and analytics infrastructure for internal business customers.
Develop and maintain data security and permissions solutions for enterprise scale data warehouse and data lake implementations including data encryption and database user access controls and logging.
Develop data objects for business analytics using data modeling techniques.
Develop and optimize data warehouse and data lake tables using best practices for DDL, physical and logical tables, data partitioning, compression, and parallelization.
Develop and maintain data warehouse and data lake metadata, data catalog, and user documentation for internal business customers.
Work with internal business customers and software development teams to gather and document
requirements for data publishing and data consumption via data warehouse, data lake, and analytics solutions.
BASIC QUALIFICATIONS
5+ years of data engineering experience
Experience with data modeling, warehousing and building ETL pipelines
Experience with SQL
Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
Experience mentoring team members on best practices
Preferred:
Past industry preference? – data center field
Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
Experience operating large data warehouses
Top 3 must-have hard skills
Cloud Technologies - 3yrs
Python – 2yrs
SQL – 3yrs
If this sounds like a fit or you’d like to learn more, I’d love to set up a
quick call at your convenience.
Looking forward to hearing from you!