Job Number: R0069993
Guide the development of data architectures and data management systems, often where none existed before. Design and inform data collection, management, and curation and standardization processes and requirements and ensure the interoperability of data structures in accordance with data governance policies and best-practices. Coordinate with business leaders, various stakeholders, and data scientists to meet business needs. Prepare verbal and written and communications incorporating strategic thinking.
-5+ years of experience with writing and maintaining extract-transfer-load (ETL) processes on a variety of structured and unstructured sources, architecting, building, and maintaining scalable automated data pipelines from the ground up, and identifying ways to improve data reliability, efficiency, and quality
-3+ years of experience with developing, testing, and maintaining data infrastructures and architectures, including databases and large-scale processing systems, designing, building, and integrating data from various resources, and conforming raw data from invalidated or system-specific sources into centralized, standardized and quality-controlled data architectures
-2+ years of experience with preparing data for predictive and prescriptive modeling and artificial intelligence (AI) and machine learning (ML) algorithms
-2+ years of experience in working with business leaders, stakeholders, and data scientists to comprehend the organization's goals and requirements and ensuring that data management practices further those goals
-2+ years of experience with scripting and programming, including Python, R, NodeJS, or PHP
-Experience with relational database skills, including SQL, queries, database definition, schema design, and database management systems, including MariaDB, SQLite, MySQL, PostgreSQL, and Oracle DB
-Experience with non-relational database skills, including languages, such as NoSQL, XML, and JSON and database management systems, such as Apache HBase, Cassandra, Redis, Amazon DynamoDB, neo4j, and MongoDB
-Experience with the DoD authority to operate (ATO) and RMF processes
-BA or BS degree in Statistics, Operations Research, Bioinformatics, Economics, Computational Biology, CS, Mathematics, Physics, EE, or Industrial Engineering
-Experience with Big Data platforms, including Apache Hadoop, Apache Spark, Redshift, Teradata or SAP HANA and technology stacks, including Pig, Hive, or Oozie
-Experience defining data management and data governance policies, requirements, and strategies
-Experience leveraging Cloud-based and client-owned platforms
-Experience with Machine Learning as a Service (MLaaS) Cloud platforms
-Experience with High Performance Computing
-Knowledge of distributed database systems
-Knowledge of the development of workflows for ML or AI
-TS/SCI clearance is preferred
-MS degree in Statistics, Operations Research, Bioinformatics, Economics, Computational Biology, CS, Mathematics, Physics, EE, or Industrial Engineering
Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; Secret clearance is required.
We're an EOE that empowers our people—no matter their race, color, religion, sex, gender identity, sexual orientation, national origin, disability, veteran status, or other protected characteristic—to fearlessly drive change.
Apply on company website