Job Title:
AI DevOps Engineer
Company: SmartAdvocate
Location: Melville, NY
Created: 2026-05-12
Job Type: Full Time
Job Description:
*About Us*Apply fast, check the full description by scrolling below to find out the full requirements for this role.SmartAdvocate® is a leading legal case management application. Conceptualized by the founding partner of a prominent personal injury law firm, SmartAdvocate® is offered as either a cloud-based or self-hosted application. Legal professionals use SmartAdvocate to manage pre-litigation and litigation cases including contacts, communications, case data, documents, document scanning and filing, document creation, calendaring, and more.Our clients run in hybrid environments "” on-premises Windows infrastructure and cloud-hosted deployments. The stack is Microsoft-centric: , SQL Server, IIS, and Windows Server, with Oracle Cloud Infrastructure (OCI) as our primary cloud platform.*About The Role*We are looking for a hands-on AI DevOps Engineer to own and build out the operational backbone of our legal case management platform. You will be the go-to person for infrastructure across traditional systems and modern AI workloads, including LLMs, RAG pipelines, vector databases, and agent-based systems.We are looking for a hands-on Infrastructure & Operations Engineer to own and build out the operational backbone of our legal case management platform. You will be the go-to person for everything infrastructure "” from development environments to production deployments across on-premises and cloud-hosted client sites.This is a high-impact, high-autonomy role. You will be the primary Ops resource, working alongside developers who currently handle infrastructure part-time. Your mission is to bring structure, reliability, and observability to our operations "” establishing proper CI/CD pipelines, monitoring, alerting, and incident response processes.*This is an on-site role. No remote applicants. This is not negotiable and will not be revisited.**What You Will Do**Build, Release & AI Deployment** Design, build, and maintain CI/CD pipelines using Azure DevOps and Jenkins* Manage build configurations, artifact publishing, and release orchestration* Coordinate deployments across multiple client environments (on-prem and cloud)* Maintain and improve source control workflows using Git*Infrastructure Management** Provision, configure, and maintain Windows Server environments (dev, test, staging, production)* Administer IIS web servers "” application pools, bindings, SSL certificates, performance tuning* Manage SQL Server instances "” installation, configuration, backups, high availability (Always On)* Maintain networking fundamentals "” DNS, firewalls, load balancers, VPN connectivity* Handle patch management and security hardening across all environments*Monitoring, Observability & AI Systems** Stand up and maintain monitoring infrastructure using Zabbix, Grafana, and Loki* Define and implement alerting rules for system health, performance, and availability* Build dashboards that give the team real-time visibility into all environments* Establish baseline metrics and SLAs for system performance*Incident Response & Troubleshooting** Serve as the primary point of contact for production infrastructure issues* Diagnose and resolve system outages, performance degradation, and deployment failures* Conduct root cause analysis and implement preventive measures* Document runbooks and operational procedures for common issues*Security & Compliance** Implement and maintain access controls, following the principle of least privilege* Manage SSL/TLS certificates across all environments* Ensure backup and disaster recovery procedures are in place and regularly tested* Support security audits and maintain awareness of data protection requirements (legal industry handles sensitive PII)*Required Skills & Experience** 5+ years of Windows Server administration "” this is a Windows shop and you must be an expert* Expert-level Microsoft SQL Server "” installation, configuration, backup/restore, performance tuning, Always On availability groups, index maintenance* Expert-level IIS administration "” application pools, URL rewrite, SSL bindings, troubleshooting, performance optimization* CI/CD pipeline experience "” Azure DevOps Pipelines and/or Jenkins, build automation, release management* Scripting with PowerShell "” automation of routine tasks, deployment scripts, system administration* Source control "” Git workflows, branching strategies, merge management* Monitoring tools "” hands-on experience with at least one observability stack (Zabbix, Grafana, Prometheus, or similar)* Networking fundamentals "” DNS, TCP/IP, firewalls, load balancers, VPN, SSL/TLS* Backup & disaster recovery "” designing and testing backup strategies, point-in-time recovery* *LLM Integration*: OpenAI Chat Completions, Assistants API, Realtime API, function calling, streaming. Just knowing the Chat API is not sufficient* *RAG Systems*: Vector databases (Chroma or equivalent), embedding models (HuggingFace/OpenAI), chunking strategies, retrieval pipelines* *Agentic Patterns*: Tool-calling agents, multi-step reasoning, agent orchestration frameworks (LangChain or equivalent)*Preferred Qualifications** Microsoft Certification (MCSA, MCSE, or Azure equivalent) "” strongly preferred* Oracle Cloud Infrastructure (OCI) experience "” compute, networking, storage, block volumes* Grafana + Loki experience for log aggregation and visualization* Zabbix experience for infrastructure monitoring* Python scripting for automation and tooling* Docker / containerization basics* Linux administration fundamentals* AWS EC2 experience* Familiarity with compliance frameworks (SOC 2 or similar)* Experience supporting multi-tenant or client-deployed software products* What Makes You a Great Fit* Ownership mentality "” you will be building this function, not slotting into an existing team. You see gaps and fill them without being asked.* Calm under pressure "” production issues happen. You diagnose methodically, communicate clearly, and fix things fast.* Automation-first mindset "” if you do something twice, you script it. Manual processes are temporary, automation is the goal.* Clear communicator "” you can explain infrastructure issues to developers and stakeholders in plain language. xijylhu * Documentation habit "” you write things down so the team doesn't depend solely on your memory.* Pragmatic problem solver "” you find the right solution for the situation, not the theoretically perfect one.Pay: $100,000.00 - $160,000.00 per yearBenefits:* 401(k)* Dental insurance* Health insurance* Paid time off* Vision insuranceExperience:* Microsoft SQL Server: 3 years (Preferred)* CI/CD: 4 years (Preferred)* PowerShell: 3 years (Preferred)* Disaster recovery: 3 years (Preferred)* Python: 4 years (Preferred)* Microsoft Windows Server: 5 years (Preferred)* AI: 3 years (Preferred)* LLM: 3 years (Preferred)* Agentic AI: 1 year (Preferred)Ability to Commute:* Melville, NY 11747 (Required)Work Location: In person