Data/AI (DA)
DCWF 624
Data Operations Specialist
Builds, manages, and operationalizes data pipelines.
Tasks
The concrete work activities defined for this role in the DCWF v5.1 spreadsheet. Core tasks are required for the role; additional tasks are associated but not mandatory.
- T400A additional Implement data management standards, requirements, and specifications.
- T520A additional Implement data mining and data warehousing applications.
- T520B additional Develop and implement data mining and data warehousing programs.
- T543 additional Develop secure code and error handling.
- T5550 additional Program custom algorithms.
- T5841 additional Advise higher level leadership on critical data management issues.
- T5844 additional Apply data acquisition, cleaning, transformation, and ingestion best practices for machine learning data conduits.
- T5846 additional Assess and address the limitations of methods to deliver data.
- T5850 additional Assist integrated project teams to identify, curate, and manage data.
- T5852 additional Build automated data management conduits.
- T5854 additional Collaborate with appropriate personnel to address Personal Health Information (PHI), Personally Identifiable Information (PII), and other data privacy and data reusability concerns for AI solutions.
- T5857 additional Comply with data classification and handling requirements through access control and security best practices.
- T5896 additional Maintain current knowledge of advancements in DoD AI Ethical Principles and Responsible AI.
- T5899 additional Manipulate and clean large, disparate datasets for bulk analysis to identify connections.
- T6470 additional Read, interpret, write, modify, and execute simple scripts (e.g., PERL, VBS) on Windows and UNIX systems (e.g., those that perform tasks such as: parsing large data files, automating manual tasks, and fetching/processing remote data).
- T702 additional Manage the compilation, cataloging, caching, distribution, and retrieval of data.
- T764 additional Perform secure programming and identify potential flaws in codes to mitigate vulnerabilities.
- T858B additional Record and manage test data.
Knowledge, Skills, and Abilities
KSA statements define what a person filling this role knows or can do. "Knowledge" is what they must know, "Skill" is what they can perform, and "Ability" is a durable capacity they bring to the work.
- A6060 ability core Ability to collect, verify, and validate test data.
- K0028 knowledge core Knowledge of data administration and data standardization policies and standards.
- K0031 knowledge core Knowledge of data mining and data warehousing principles.
- K0032 knowledge core Knowledge of database management systems, query languages, table relationships, and views.
- K0104 knowledge core Knowledge of query languages such as SQL (structured query language).
- K1128 knowledge core Knowledge of Java-based database access application programming interface (API) (e.g., Java Database Connectivity [JDBC]).
- K1128A knowledge core Knowledge of database access application programming interfaces (APIs) (e.g., Java Database Connectivity [JDBC]).
- K6300 knowledge core Knowledge of how to utilize Hadoop, Java, Python, SQL, Hive, and PIG to explore data.
- K7017 knowledge core Knowledge of data operations (DataOps) processes and best practices.
- K7019 knowledge core Knowledge of data security roles and responsibilities.
- K7029 knowledge core Knowledge of how to collect, store, and monitor data.
- S179B skill core Skill in establishing data security controls.
- S186 skill core Skill in developing data dictionaries.
- S3722 skill core Skill in data mining techniques (e.g., searching file systems) and analysis.
- S6520 skill core Skill in data pre-processing (e.g., imputation, dimensionality reduction, normalization, transformation, extraction, filtering, smoothing).
- S6610 skill core Skill in performing format conversions to create a standard representation of the data.
- S6690 skill core Skill in transformation analytics (e.g., aggregation, enrichment, processing).
- S6730 skill core Skill in using data mapping tools.
- S6760 skill core Skill in writing scripts using R, Python, PIG, HIVE, SQL, etc.
- S7062 skill core Skill in developing and maintaining automation scripts.
- S7066 skill core Skill in identifying data acquisition, collection, and curation risks.
- K0942 knowledge additional Knowledge of the organization's core business/mission processes.
- K1034A knowledge additional Knowledge of Personally Identifiable Information (PII) data security standards.
- K1034C knowledge additional Knowledge of Personal Health Information (PHI) data security standards.
- K7010 knowledge additional Knowledge of container orchestration and resource management platforms.
- K7020 knowledge additional Knowledge of DoD AI Ethical Principles (e.g., responsible, equitable, traceable, reliable, and governable).
- K7025 knowledge additional Knowledge of how AI solutions integrate with cloud or other IT infrastructure.
- K7028 knowledge additional Knowledge of how to automate development, testing, security, and deployment of AI/machine learning-enabled software.
- K7036 knowledge additional Knowledge of laws, regulations, and policies related to AI, data security/privacy, and use of publicly procured data for government.