Vladislav Bubnov

Vladislav Bubnov

DevOps / SRE Engineer

GitHub LinkedIn

๐Ÿ‘‹ About

DevOps Engineer with 7+ years of experience building and operating production infrastructure. I specialize in Kubernetes, CI/CD, Infrastructure as Code, and SRE practices. Experienced in designing highly available architectures, migrating services to containers, building monitoring and centralized logging systems. Implemented DORA metrics to measure engineering performance. Hands-on with cloud providers (Yandex Cloud, Selectel). Actively using LLM tools (ChatGPT, Claude, Cursor) to boost productivity.

๐Ÿ›  Skills

Containers & Orchestration

KubernetesDockerHelmKubespray

IaC & Automation

AnsibleTerraformBashPythonGolang

CI/CD

GitLab CI/CDGitLab RunnerArgoCDwerf

Monitoring & Observability

PrometheusAlertmanagerGrafanaLokiTempoOpenTelemetryELKVectorClickHouse

Databases

PostgreSQLCitusPatroniMongoDBMySQLClickHouseConsul

Web Servers & Proxies

NginxApache

Cloud Platforms

Yandex CloudSelectel

Tools

GitSonarQubeSentryNexusHarborRedashMattermostSlack

SRE Practices

Incident ManagementDORA metricsHigh AvailabilityDisaster RecoveryBackup & Restore

Operating Systems

Linuxadministrationtroubleshootingnetworking

๐Ÿ’ผ Experience

Lead DevOps Engineer

HRBox Mar 2023 โ€” Present ยท Moscow (remote)
  • Designing and maintaining fault-tolerant Kubernetes infrastructure for a multi-tenant SaaS platform (Managed Kubernetes, Selectel)
  • Developing universal Helm charts with dynamic parameterization via values for multiple locations, following DRY principles
  • Launching new locations and data centers: infrastructure provisioning from scratch
  • Driving adoption and implementation of microservices architecture
  • Containerizing services and optimizing Docker images (build from scratch, multistage builds)
  • Building CI/CD pipeline: building and delivering code to dev/stage/prod (GitLab CI, werf, Yandex Container Registry)
  • Managing two distributed PostgreSQL clusters (Citus): troubleshooting, analysis and performance optimization
  • Implementing PostgreSQL backup strategy with PITR: migration from pg_probackup to WAL-G with streaming to S3, reducing backup/restore time and size. Implementing 3-2-1 backup scheme
  • Infrastructure automation with Ansible: writing roles, testing with Molecule
  • Incident response and post-mortem analysis
  • Building monitoring and alerting system from scratch (Prometheus, Grafana, Alertmanager), developing a custom Prometheus exporter for SMS gateway (Python)
  • Centralized logging: implemented Promtail, then migrated to FluentBit โ†’ FluentD โ†’ Loki stack
  • Implementing OpenTelemetry standard: setting up trace collection (Tempo), writing documentation for the development team
  • DDoS protection implementation (CrowdSec)
  • Developing internal tools and scripts (Python, Go, Bash): anomaly detection, troubleshooting automation, tenant migration, YouTrack integration
  • Initiating and coordinating launch of a new product (chat platform): project owner role, deploying LiveKit for video calls, developing Electron desktop client prototype
  • Implementing dynamic dev environments in Kubernetes โ€” 70% faster testing cycles, testers work via browser without local deployment
  • Infrastructure cost optimization: moving stateless services to preemptible nodes while dedicating critical components (PostgreSQL, Redis, message bus, task scheduler) to a dedicated pool โ€” 20% cost reduction
  • Implementing Redash for dev environments: giving developers quick access to test database data without direct cluster connections
  • Implementing ChatOps in Slack: integration with Alertmanager, Grafana, Sentry, YouTrack, GitLab CI, escalations, cron job statuses, prototyping semantic search over message history
  • Hiring and mentoring a DevOps engineer: preparing a full replacement to ensure process continuity
  • Actively using LLM tools to boost productivity (ChatGPT, Claude, Cursor)

DevOps Engineer

Yandex Practicum Mar 2023 โ€” Aug 2024 (1 yr 6 mo) ยท Moscow (remote)
  • Provisioning and maintaining educational infrastructure in Yandex Cloud: VMs, Kubernetes clusters, Nexus (Terraform, Ansible)
  • Building CI/CD pipelines (GitLab CI, ArgoCD)
  • Developing a self-service web application for students (Python, Flask): LDAP authentication, automatic kubeconfig generation, dynamic namespace creation with RBAC on first login โ€” reduced on-call engineer load by 25%
  • Nexus maintenance automation: script for cleaning up stale repositories via API, solving issues with the built-in cleanup mechanism
  • Developing automation tools to reduce routine workload for the team

DevOps Course Mentor

Yandex Practicum Jun 2022 โ€” Mar 2023 (10 mo) ยท Moscow (remote)
  • Code reviews of student assignments, reviewing IaC configurations and CI/CD pipelines
  • Conducting webinars on course topics (Docker, Kubernetes, CI/CD, monitoring)
  • Mentoring and advising students on DevOps practices

Senior DevOps Engineer

Cloud Team Feb 2022 โ€” Mar 2023 (1 yr 2 mo) ยท Kovrov

Outsourcing company, primary client โ€” VseInstrumenty.ru

  • Implementing DORA metrics to measure engineering team effectiveness
  • Participating in SRE team: incident response, conducting post-mortems
  • Working with cloud providers (Yandex Cloud, Selectel): designing and managing cloud infrastructure
  • Ensuring PostgreSQL HA: Patroni + Consul as KV store
  • Participating in logging system migration to Vector + ClickHouse + Redash
  • Developing an on-call SRE dashboard (Python, Flask): one-click traffic switching between data centers โ€” reducing human error and speeding up incident response
  • Onboarding and training new team members

DevOps Engineer

Cloud Team Mar 2021 โ€” Jan 2022 (11 mo) ยท Kovrov

Outsourcing company, primary client โ€” VseInstrumenty.ru

  • Migrating services to Kubernetes: deploying and maintaining two bare-metal clusters (Kubespray)
  • Containerizing services, writing Helm charts, deploying via werf
  • Building monitoring system from scratch (Prometheus, Grafana, Alertmanager)
  • Building centralized logging (ELK/EFK stack)

Senior System Administrator

Eteron LLC Jun 2020 โ€” Feb 2021 (9 mo) ยท Smolensk
  • Promoted after senior employee departure: took over IT infrastructure management and team coordination
  • Maintaining and developing previously implemented systems (Docker, CI/CD, monitoring, logging)
  • Hiring and training a new system administrator, preparing a replacement

Linux System Administrator

IT Grade Sep 2019 โ€” Sep 2020 (1 yr 1 mo) ยท Smolensk
  • Automating standard application server deployments (Bitrix): writing Ansible roles instead of manual configuration
  • Configuring and optimizing web servers (Nginx, Apache)
  • Deploying Elasticsearch, developing a network route monitoring utility (traceroute โ†’ FluentBit โ†’ Elasticsearch)
  • Configuring MySQL replication

Linux System Administrator

Eteron LLC Mar 2019 โ€” Jun 2020 (1 yr 4 mo) ยท Smolensk
  • Building centralized logging system (Elasticsearch, FluentD/FluentBit, Kibana)
  • Migrating infrastructure to Docker
  • Building CI/CD process from scratch
  • Implementing automation and code quality tools (SonarQube, Mattermost)
  • Deploying monitoring system (Prometheus + Alertmanager + Grafana)
  • Automating infrastructure tasks (Ansible, Bash, Python)

System Administrator

Newcontact (now Neovox) Feb 2017 โ€” Dec 2018 (1 yr 11 mo) ยท Smolensk
  • Administering infrastructure for 320+ workstations
  • Managing Active Directory and network infrastructure
  • Administering Windows/Linux servers

Technical Support Specialist

Mannet Jul 2015 โ€” Oct 2016 (1 yr 4 mo) ยท Smolensk

Internet service provider

  • First-line support: receiving and processing requests, diagnostics and troubleshooting, incident response
  • Grew into second-line functions: working with network engineers, using Zabbix
  • Testing new equipment, maintaining and building the knowledge base

๐ŸŽ“ Education

Bachelor's โ€” Applied Computer Science in Economics

RANEPA (Smolensk branch), 2018

๐ŸŽฎ Beyond Work

Outside of work, I keep doing the same things โ€” running a homelab, experimenting with self-hosting and smart home setups, planning to build my own Kubernetes cluster with Thanos. I collect and mod retro gaming consoles. Currently dealing with my midlife crisis by learning to skateboard. I genuinely enjoy tackling complex problems and constantly learning โ€” both on and off the clock.