If you are a Data Engineer with advanced SQL and scripting experience, please read on!
Job Title: Data Engineer - Python & SQL
Job Location: Pittsburgh, PA - East Liberty - remote until we return to the office in September
Compensation: $100K - $150K+ base salary depending experience
Our company empowers America's defense leaders by providing decision-grade information through our purpose-built data science and analytics platform. Leveraging our database of record, we apply machine driven processes and proprietary algorithms to answer vexing questions and advance the national defense strategy. This position will start out remote and then be based out of our Pittsburgh, PA office once we return to the office (September at the earliest as of now).
We are passionate about data, we are passionate about code, and we are passionate about the product we offer. We are seeking an exceptional and experienced data engineer who shares our passion and obsession with quality. You'll be a core member of our product team dedicated to helping our customers displace time-consuming, manual processes to reach informed real-time decisions about government markets, competitors, and agency relationships.
Top Reasons to Work with Us
- Competitive Compensation ($100K - $150K+ base salary DOE) plus equity!
- Work for a VC-backed Big Data startup with a great upside potential
- Join a company with a distinguished Board of Directors - including 2 esteemed Google engineering execs!
- Work with multiple cutting-edge Big Data open source technologies
- Job Satisfaction - Opportunity to join the core team at a pivotal growth stage and influence the company's future!
What You Will Be Doing
- Define and lead our data life-cycle strategy across data transformation, data ingestion and data consistency.
- Identify data sources, assess their value and quality and estimate the level of effort required to integrate into existing data model, infrastructure and products.
- Ensure key entities within data sets are identified, resolved and linked to existing entities within the current master data repository.
- Apply various techniques to produce solutions to large-scale optimization problems, including data pre-processing, indexing, blocking, field and record comparison and classification.
- Develop, refine and oversee master data management standards, including establishing and enforcing governance procedures and ensuring data integrity across multiple functions. Responsible for owning data quality metrics and meeting defined data accuracy goals according to industry best practices.
- Improve data sharing, increase data re-purposing and improve cost efficiency associated with data management efforts.
- Build best practices that help with chain of custody of data so it can be easily traced back to the source for accuracy and consistency.
- Work across functional teams to understand advanced statistical, machine learning, and text processing models. Incorporate them into our existing data engineering infrastructure.
- Perform exploratory data analyses, generate and test working hypotheses, prepare and analyze historical data and identify patterns.
- Work directly with users as well as SMEs to establish, create and populate optimal data architectures and structures, as well as articulate techniques and results using non-technical language.
What You Need for this Position
Bachelor's Degree in Computer Science, Mathematics, or similar with 3+ years experience with the following:
- Advanced SQL programming
- Advanced Scripting experience (Python, Ruby, Perl or similar)
- Extensive Linux/Unix experience
- PostgreSQL, MySQL or similar RDBMS
- Working with multiple distributed systems/disparate data sets
- Data pipeline management (Master Data Management (MDM) - linking of data sets
- Proficient usage of common data formats such as CSV, XML, and JSON
Preferred:
- Strong Python or similar scripting experience
- Amazon Web Services (AWS)
- Master Data Management (MDM) experience including data consolidation, linkage, federation, and dissemination
- Experience in/exposure to the nuances of a startup or other entrepreneurial environment
What's In It for You
- Competitive Compensation ($100K - $150K+ base depending on experience)
- We pay for 100% of employee premiums and 90% of dependent premiums (through United Healthcare)
- Vision, dental, STD, LTD, AD&D and life insurance
- Unlimited/flexible PTO policy
- Casual work environment - jeans casual!
- 401k with company match
- Equity!
So, if you are a Data Engineer with experience, please apply today! or send an updated copy of your resume to Mike.Vandenbergh@CyberCoders.com for immediate consideration!
For this position you must be currently authorized to work in the United States. We do not sponsor for this position.