DevOps Engineer with 7+ years of experience building and operating production infrastructure. I specialize in Kubernetes, CI/CD, Infrastructure as Code, and SRE practices. Experienced in designing highly available architectures, migrating services to containers, building monitoring and centralized logging systems. Implemented DORA metrics to measure engineering performance. Hands-on with cloud providers (Yandex Cloud, Selectel). Actively using LLM tools (ChatGPT, Claude, Cursor) to boost productivity.
Designing and maintaining fault-tolerant Kubernetes infrastructure for a multi-tenant SaaS platform (Managed Kubernetes, Selectel)
Developing universal Helm charts with dynamic parameterization via values for multiple locations, following DRY principles
Launching new locations and data centers: infrastructure provisioning from scratch
Driving adoption and implementation of microservices architecture
Containerizing services and optimizing Docker images (build from scratch, multistage builds)
Building CI/CD pipeline: building and delivering code to dev/stage/prod (GitLab CI, werf, Yandex Container Registry)
Managing two distributed PostgreSQL clusters (Citus): troubleshooting, analysis and performance optimization
Implementing PostgreSQL backup strategy with PITR: migration from pg_probackup to WAL-G with streaming to S3, reducing backup/restore time and size. Implementing 3-2-1 backup scheme
Infrastructure automation with Ansible: writing roles, testing with Molecule
Incident response and post-mortem analysis
Building monitoring and alerting system from scratch (Prometheus, Grafana, Alertmanager), developing a custom Prometheus exporter for SMS gateway (Python)
Centralized logging: implemented Promtail, then migrated to FluentBit โ FluentD โ Loki stack
Implementing OpenTelemetry standard: setting up trace collection (Tempo), writing documentation for the development team
Initiating and coordinating launch of a new product (chat platform): project owner role, deploying LiveKit for video calls, developing Electron desktop client prototype
Implementing dynamic dev environments in Kubernetes โ 70% faster testing cycles, testers work via browser without local deployment
Infrastructure cost optimization: moving stateless services to preemptible nodes while dedicating critical components (PostgreSQL, Redis, message bus, task scheduler) to a dedicated pool โ 20% cost reduction
Implementing Redash for dev environments: giving developers quick access to test database data without direct cluster connections
Implementing ChatOps in Slack: integration with Alertmanager, Grafana, Sentry, YouTrack, GitLab CI, escalations, cron job statuses, prototyping semantic search over message history
Hiring and mentoring a DevOps engineer: preparing a full replacement to ensure process continuity
Actively using LLM tools to boost productivity (ChatGPT, Claude, Cursor)
DevOps Engineer
Yandex PracticumMar 2023 โ Aug 2024 (1 yr 6 mo) ยท Moscow (remote)
Provisioning and maintaining educational infrastructure in Yandex Cloud: VMs, Kubernetes clusters, Nexus (Terraform, Ansible)
Building CI/CD pipelines (GitLab CI, ArgoCD)
Developing a self-service web application for students (Python, Flask): LDAP authentication, automatic kubeconfig generation, dynamic namespace creation with RBAC on first login โ reduced on-call engineer load by 25%
Nexus maintenance automation: script for cleaning up stale repositories via API, solving issues with the built-in cleanup mechanism
Developing automation tools to reduce routine workload for the team
Implementing DORA metrics to measure engineering team effectiveness
Participating in SRE team: incident response, conducting post-mortems
Working with cloud providers (Yandex Cloud, Selectel): designing and managing cloud infrastructure
Ensuring PostgreSQL HA: Patroni + Consul as KV store
Participating in logging system migration to Vector + ClickHouse + Redash
Developing an on-call SRE dashboard (Python, Flask): one-click traffic switching between data centers โ reducing human error and speeding up incident response
Administering infrastructure for 320+ workstations
Managing Active Directory and network infrastructure
Administering Windows/Linux servers
Technical Support Specialist
MannetJul 2015 โ Oct 2016 (1 yr 4 mo) ยท Smolensk
Internet service provider
First-line support: receiving and processing requests, diagnostics and troubleshooting, incident response
Grew into second-line functions: working with network engineers, using Zabbix
Testing new equipment, maintaining and building the knowledge base
๐ Education
Bachelor's โ Applied Computer Science in Economics
RANEPA (Smolensk branch), 2018
๐ฎ Beyond Work
Outside of work, I keep doing the same things โ running a homelab, experimenting with self-hosting and smart home setups, planning to build my own Kubernetes cluster with Thanos. I collect and mod retro gaming consoles. Currently dealing with my midlife crisis by learning to skateboard. I genuinely enjoy tackling complex problems and constantly learning โ both on and off the clock.