Job Title:
AWS Data Lake Technical Lead - REMOTE with Vocational Onsite Visits at Waterbury, VT (Travel Expenses will be reimbursed)

Company: My3Tech

Location: miami, FL

Created: 2024-05-13

Job Type: Full Time

Job Description:

Job Title: AWS Data Lake Technical LeadJob Location: REMOTE with Vocational Onsite Visits at Waterbury, VT (Travel Expenses will be reimbursed)Duration: 3+ Years Contract Job Description:Background:Client is seeking to improve statewide law enforcement data access. The purpose is to design and implement a state-controlled system to access de-identified, aggregated law enforcement and related data currently housed in a record management system.Client in collaboration with the Agency of Digital (ADS) Services, is seeking to procure Amazon Web Service (AWS) professional services to work with the ADS Technical Lead to build out the Public Services Lakehouse environment.Existing Technology EnvironmentClient currently uses a Computer Aided Dispatch Records Management System (CAD RMS) system that is currently running in a MySQL instance that will be used in the first phase of the Data Lake build. The supplemental data ranges from SQL server instances and flat file sources that are currently housed internally either in SharePoint or internal file stores.Requirements:Design and implement the Clients' Data Lake in the AWS EnvironmentStore Data in AWS CJIS compliant environment.Implement and design the lake house technologies with the IT Tech Leadassigned to the project.Design and implement using latest AWS Lake House standards.Data Security LayerImplement and design security Identity and Access Management roles and processes.Design and Create security IAM templates.Data Ingestion LayerIngestion Design for variety of sourcesOperational Database SourcesMySQL, SQL ServerSaaS ApplicationsFile Shares (SharePoint and one drive)Stream Data SourcesSystem Templates of ingestion processesData Storage LayerAWS S3 CJIS compliant S3 bucketsAWS Redshift InfrastructurePower BI Connector processData Processing LayerData Extract, Load and Transform (ELT) process for loading from source to S3 and Transforming data from S3 to Redshift for reporting and analytics.Templates for creating ELT processes for future processes.Data Catalog LayerDesign and implement solution to solve data schema drift in AWS Glue for use with reporting and analytical needs.Design and build crawlers for schema and build catalog that stores schema information.Meta data store in catalog for consumption in data warehouseCreate Catalog crawler templates for data sources.Create Templates for expansion of the data lake for future agencies.Base data warehouse implemented environment for reportingdashboard use.Template designs for future data Lakehouse implementations.Lake House design needs to be able to be used by Power BIDesign each layer for scaling based on usage.Professional Service Requirements:AWS Lakehouse CertificationAWS Design and Implementation Lake House TechnologiesAWS IAMAWS Lakehouse TechnologyAWS Glue KnowledgeData Catalog and CrawlerAWS RedshiftAWS AthenaCJIS certified data storageAWS CJIS Data locationCJIS Security Background Check (See Additional Attachments Assurances)