Page MenuHomePhabricator

taavi (Taavi Väänänen)
UserAdministrator

Projects (29)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Feb 24 2019, 3:58 PM (279 w, 1 d)
Roles
Administrator
Availability
Busy Busy until Jul 31.
IRC Nick
taavi
LDAP User
Majavah
MediaWiki User
Taavi [ Global Accounts ]

Recent Activity

Today

taavi closed T351450: Migrate Cloud VPS puppet infrastructure to Puppet 7 as Resolved.

which is tracked in T349619: Migrate roles to puppet7 and not here.

Mon, Jul 1, 1:39 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Goal, Puppet (Puppet 7.0), Cloud-VPS
aborrero awarded T345294: Move Cloud VPS control plane alerting to alertmanager a Blobhaj token.
Mon, Jul 1, 1:30 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS

Fri, Jun 28

taavi updated taavi.
Fri, Jun 28, 11:57 AM
taavi added a comment to T368717: Add support for locking and unlocking LDAP account to Bitu(-LDAP).

Duplicate of T359820?

Fri, Jun 28, 11:53 AM · Infrastructure-Foundations, Bitu
taavi updated Other Assignee for T367723: Migrate WMCS managed projects to g4 flavors, added: Andrew.
Fri, Jun 28, 10:57 AM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi updated Other Assignee for T364457: Migrate eqiad1 hypervisors to Neutron OVS agent, added: Andrew.
Fri, Jun 28, 10:57 AM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi awarded T279110: [infra] Replace PodSecurityPolicy in Toolforge Kubernetes a Blobhaj token.
Fri, Jun 28, 10:01 AM · Patch-For-Review, User-aborrero, cloud-services-team, Toolforge
taavi closed T362445: Taavi knowledge transfer: Toolforge k8s upgrades as Resolved.
Fri, Jun 28, 9:17 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362444: Taavi knowledge transfer: maintain-kubeusers as Resolved.
Fri, Jun 28, 9:17 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362446: Taavi knowledge transfer: toolforge job investigation as Resolved.
Fri, Jun 28, 9:16 AM · User-dcaro, Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362444: Taavi knowledge transfer: maintain-kubeusers , a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362445: Taavi knowledge transfer: Toolforge k8s upgrades, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362448: Taavi knowledge transfer: rebuild toolforge docker images as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362443: Learn how to do what Taavi does, a subtask of T335978: openstack: consider removing references to old hardware from the database, as Resolved.
Fri, Jun 28, 9:15 AM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi closed T362450: Taavi knowledge transfer: Cloud VPS OpenTofu provider as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362443: Learn how to do what Taavi does as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362447: Taavi knowledge transfer: Toolforge misc services (e.g. mail server), a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi awarded T362443: Learn how to do what Taavi does a Blobhaj token.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362447: Taavi knowledge transfer: Toolforge misc services (e.g. mail server) as Resolved.
Fri, Jun 28, 9:15 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362446: Taavi knowledge transfer: toolforge job investigation, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362448: Taavi knowledge transfer: rebuild toolforge docker images, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362449: Taavi knowledge transfer: python-flask-keystone, novaproxy, enc api, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362450: Taavi knowledge transfer: Cloud VPS OpenTofu provider, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362452: Taavi knowledge transfer: cloud-vps monitoring, a subtask of T362443: Learn how to do what Taavi does, as Resolved.
Fri, Jun 28, 9:14 AM · Toolforge, Cloud-VPS, cloud-services-team
taavi closed T362452: Taavi knowledge transfer: cloud-vps monitoring as Resolved.
Fri, Jun 28, 9:14 AM · User-dcaro, Toolforge, Cloud-VPS, cloud-services-team
taavi placed T341338: eqiad1: fix PTR delegations for 185.15.56.0/24 up for grabs.

What's left is removing the old 56.15.185.in-addr.arpa. zone from Designate (while being careful not to remove 0-25.56.15.185.in-addr.arpa.). @Andrew @aborrero can either of you take care of that?

Fri, Jun 28, 9:06 AM · Cloud-VPS, cloud-services-team (FY2023/2024-Q3-Q4), User-aborrero

Thu, Jun 27

taavi closed T311908: Migrate Toolforge Kubernetes hosts to Debian Bullseye or later as Resolved.
Thu, Jun 27, 9:15 PM · cloud-services-team, Kubernetes, Toolforge
taavi closed T311908: Migrate Toolforge Kubernetes hosts to Debian Bullseye or later, a subtask of T311897: Toolforge: migrate to Debian Bullseye or later, as Resolved.
Thu, Jun 27, 9:14 PM · cloud-services-team (FY2023/2024-Q3-Q4), Goal, Cloud-VPS (Debian Buster Deprecation), Toolforge, Epic
taavi created P65538 (An Untitled Masterwork).
Thu, Jun 27, 4:56 PM
taavi closed T368634: Request to increase catalyst project: cores and memory as Resolved.
Thu, Jun 27, 4:54 PM · cloud-services-team, Cloud-VPS (Quota-requests)
taavi created T368599: Archive tech-decision-forum.
Thu, Jun 27, 9:19 AM · tech-decision-forum, Project-Admins

Wed, Jun 26

taavi updated the task description for T367723: Migrate WMCS managed projects to g4 flavors.
Wed, Jun 26, 2:50 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi created P65477 (An Untitled Masterwork).
Wed, Jun 26, 1:01 PM
taavi reassigned T368426: Migrate codfw1dev hypervisors to Neutron OVS agent from taavi to Andrew.
Wed, Jun 26, 12:58 PM · cloud-services-team, User-aborrero, Cloud-VPS
taavi closed T363631: New upstream release for Pywikibot as Resolved.
Wed, Jun 26, 11:41 AM · Toolforge (Toolforge iteration 11)
taavi claimed T363631: New upstream release for Pywikibot.
Wed, Jun 26, 11:37 AM · Toolforge (Toolforge iteration 11)
taavi closed T337010: cloud vps: fix flavor g3.cores16.ram32.disk20 id 37ed9aaa-35b2-4141-8bc4-272ec8bbc303 as Resolved.

Closing since this is being resolved with the new g4 flavors.

Wed, Jun 26, 10:18 AM · Cloud-VPS
taavi removed a watcher for Cloud-VPS (Debian Buster Deprecation): taavi.
Wed, Jun 26, 9:43 AM
taavi removed a watcher for Toolforge (Quota-requests): taavi.
Wed, Jun 26, 9:42 AM
taavi removed a watcher for Cloud-VPS (Quota-requests): taavi.
Wed, Jun 26, 9:42 AM
taavi removed a watcher for LDAP: taavi.
Wed, Jun 26, 9:40 AM
taavi removed a watcher for Cloud Services Proposals: taavi.
Wed, Jun 26, 9:40 AM
taavi closed T368464: Request quota increase for spacemedia project as Resolved.
Wed, Jun 26, 9:39 AM · cloud-services-team, Tool-spacemedia, Cloud-VPS (Quota-requests)
taavi changed the subtype of T368475: Toolforge fourohfour similar name list should link directly to tools instead of toolsadmin from "Task" to "Feature Request".

which does not have a link to the webservice

Wed, Jun 26, 9:30 AM · Toolforge
taavi added a comment to T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes.

0.0.50 (or a later version) still needs to be deployed to tools.

Wed, Jun 26, 9:26 AM · Toolforge (Toolforge iteration 11)
taavi added a parent task for T368516: [envvars-api] version 0.0.50 introduces breaking changes that need adapting for replica_cnf service: T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes.
Wed, Jun 26, 9:26 AM · Patch-For-Review, Toolforge (Toolforge iteration 11)
taavi added a subtask for T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes: T368516: [envvars-api] version 0.0.50 introduces breaking changes that need adapting for replica_cnf service.
Wed, Jun 26, 9:25 AM · Toolforge (Toolforge iteration 11)
taavi reopened T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes as "Open".

0.0.50 (or a later version) still needs to be deployed to tools.

Wed, Jun 26, 9:25 AM · Toolforge (Toolforge iteration 11)
taavi closed T368463: `webservice` (build 0.103.8) crashes on login-buster.toolforge.org (python 3.7) as Resolved.

The patch was merged. Given that the local hack is fixing the issue for now and the next tagged version includes the fix, I'm closing this task.

Wed, Jun 26, 9:24 AM · Toolforge
taavi added a comment to T368512: toolforge: maintain-kubeusers crashes if LDAP server terminates session.

Duplicate of T352011: maintain-kubeusers occasionally crashes to a LDAP connection error?

Wed, Jun 26, 8:40 AM · User-aborrero, cloud-services-team, Toolforge
taavi added a comment to T368439: DNS name resolution failure with cdn.esahubble.org from Cloud VPS & Toolforge.

Seemingly this works now:

taavi@tools-bastion-12:~ $ dig cdn.esahubble.org
Wed, Jun 26, 7:54 AM · Tool-spacemedia, Cloud-VPS
taavi removed a project from T368439: DNS name resolution failure with cdn.esahubble.org from Cloud VPS & Toolforge: Toolforge.
Wed, Jun 26, 7:44 AM · Tool-spacemedia, Cloud-VPS
taavi removed a watcher for MediaWiki-extensions-OATHAuth: taavi.
Wed, Jun 26, 5:21 AM

Tue, Jun 25

taavi closed T358761: Deploy OVS test setup in codfw1dev as Resolved.

Testing has been completed. There's still some cloudvirts to migrate but that's happening as a part of the real migration.

Tue, Jun 25, 3:38 PM · cloud-services-team (FY2023/2024-Q3-Q4), User-aborrero, Cloud-VPS
taavi placed T144943: Groups and tools only refreshed at login up for grabs.
Tue, Jun 25, 3:38 PM · Patch-For-Review, Striker
taavi placed T345294: Move Cloud VPS control plane alerting to alertmanager up for grabs.
Tue, Jun 25, 3:38 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi placed T347683: openstack: create a cookbook to inject commands to VMs via console at scale up for grabs.
Tue, Jun 25, 3:38 PM · SRE-OnFire, Sustainability (Incident Followup), User-dcaro, cloud-services-team, Cloud-VPS
taavi placed T367287: Update Wikitech's LDAP credentials to be read-only up for grabs.
Tue, Jun 25, 3:37 PM · Patch-For-Review, Infrastructure-Foundations, cloud-services-team, LDAP, wikitech.wikimedia.org
taavi closed T358761: Deploy OVS test setup in codfw1dev, a subtask of T326373: Migrate Cloud VPS to Neutron Open vSwitch agent, as Resolved.
Tue, Jun 25, 3:36 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), User-aborrero, Cloud-VPS
taavi placed T359217: Update Django version used in Striker up for grabs.
Tue, Jun 25, 3:36 PM · Patch-For-Review, Striker
taavi placed T314705: Toolforge: Ensure long-running Kubernetes pods get container updates applied up for grabs.
Tue, Jun 25, 3:36 PM · cloud-services-team, Toolforge
taavi placed T359428: Striker should use ID instead of username to identify SUL accounts up for grabs.
Tue, Jun 25, 3:36 PM · Patch-For-Review, Striker
taavi placed T352886: [jobs-api,jobs-cli] php 8.2 crashes when using XMLReader up for grabs.
Tue, Jun 25, 3:35 PM · cloud-services-team, Toolforge
taavi placed T328502: Move WMCS off of Icinga and introduce alertmanager up for grabs.
Tue, Jun 25, 3:35 PM · cloud-services-team (FY2023/2024-Q3-Q4), Toolforge, Cloud-VPS, Observability-Alerting, Goal
taavi placed T288053: Add external meta-monitoring for metricsinfra up for grabs.
Tue, Jun 25, 3:35 PM · SRE-OnFire, Patch-For-Review, cloud-services-team, Sustainability (Incident Followup), Cloud-VPS
taavi placed T262562: [infra] Fix the mis-named k8s service in tools and toolsbeta projects up for grabs.
Tue, Jun 25, 3:35 PM · cloud-services-team, User-Majavah, Toolforge
taavi updated the task description for T367723: Migrate WMCS managed projects to g4 flavors.
Tue, Jun 25, 12:32 PM · cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi committed rCCKBd18dacf5a976: vps: Add a cookbook to move a floating IP address to an another server.
vps: Add a cookbook to move a floating IP address to an another server
Tue, Jun 25, 11:15 AM
taavi added a comment to T368316: maintain-dbusers.service failing on cloudcontrol1005.

The relevant firewall rule is the wiki-replica-account-creation one defined in cr-labs.yaml. That would need to have an-redacteddb_group added to destination-address to allow this traffic.

Tue, Jun 25, 7:36 AM · Data-Services, cloud-services-team

Mon, Jun 24

taavi edited projects for T368316: maintain-dbusers.service failing on cloudcontrol1005, added: Data-Services; removed Toolforge.
Mon, Jun 24, 8:28 PM · Data-Services, cloud-services-team
taavi closed T367964: Provision more non-NFS k8s workers as Resolved.
Mon, Jun 24, 2:31 PM · Toolforge (Toolforge iteration 11)
taavi added a comment to T367415: Allow Quarry to query its own database.

OTOH exposing the list of users that have logged in to Quarry, even if they've not interacted with anything that leaves a public trace, feels a bit questionable.

Mon, Jun 24, 2:15 PM · cloud-services-team (FY2023/2024-Q3-Q4), Quarry
taavi committed rCCKBbdb013ff2bb4: kubernetes: Handle pods with no ownerReferences.
kubernetes: Handle pods with no ownerReferences
Mon, Jun 24, 1:22 PM
taavi updated subscribers of T368265: Disk volumes of cloud instances are completely mixed-up.

Hi, sorry for that. The servers were rebooted to pick up updated network settings: https://lists.wikimedia.org/hyperkitty/list/[email protected]/message/IYVYMGLPNOU6JON52PV6R6NKX2XHMK6R/

Mon, Jun 24, 1:13 PM · Cloud-VPS, Cloud-Services-Origin-User, affects-Kiwix-and-openZIM
taavi claimed T367964: Provision more non-NFS k8s workers.
Mon, Jun 24, 9:07 AM · Toolforge (Toolforge iteration 11)

Sat, Jun 22

taavi added a comment to T365154: video2commons general failure.

Note that Grafana graph is on your local time zone by default. The video2commons encoder instances were rebooted as a part of https://lists.wikimedia.org/hyperkitty/list/[email protected]/message/IYVYMGLPNOU6JON52PV6R6NKX2XHMK6R/ starting at 13:45 UTC, so them starting to pick up load at about 13:50 UTC matches that very closely.

Sat, Jun 22, 12:43 PM · video2commons

Thu, Jun 20

taavi added a comment to T367220: Archive the OpenStackManager extension.

Yes.

Thu, Jun 20, 8:01 PM · MediaWiki-extensions-OpenStackManager, translatewiki.net, Wikimedia-GitHub, Diffusion-Repository-Administrators, Projects-Cleanup
taavi updated the task description for T364457: Migrate eqiad1 hypervisors to Neutron OVS agent.
Thu, Jun 20, 2:40 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi placed T300427: Automate maintain-views replica depooling up for grabs.
Thu, Jun 20, 1:58 PM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Platform-SRE, Data-Services
taavi updated the task description for T364457: Migrate eqiad1 hypervisors to Neutron OVS agent.
Thu, Jun 20, 12:38 PM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi added a comment to T368007: NodeDown (cloudvirt1063).
-------------------------------------------------------------------------------
Record:      27
Date/Time:   06/20/2024 02:45:43
Source:      system
Severity:    Critical
Description: CPU 2 has a thermal trip (over-temperature) event.
-------------------------------------------------------------------------------
Thu, Jun 20, 12:32 PM · cloud-services-team (Hardware)
taavi updated the task description for T364457: Migrate eqiad1 hypervisors to Neutron OVS agent.
Thu, Jun 20, 11:20 AM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi updated the task description for T364457: Migrate eqiad1 hypervisors to Neutron OVS agent.
Thu, Jun 20, 10:36 AM · Patch-For-Review, cloud-services-team (FY2023/2024-Q3-Q4), Cloud-VPS
taavi updated the task description for T351637: add proper dry-run/diff mode to maintain-views.
Thu, Jun 20, 9:14 AM · Data-Services
taavi added a comment to T300427: Automate maintain-views replica depooling.

Killing existing sessions on depooled servers still doesn't work as expected. So what's left is either fixing that functionality on the HAProxy config somehow, or updating maintain-views to have the ability to kill sessions that are holding metadata locks on views that need replacing.

Thu, Jun 20, 9:04 AM · cloud-services-team (FY2023/2024-Q3-Q4), Data-Platform-SRE, Data-Services

Wed, Jun 19

taavi closed T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage as Resolved.

The reimages finished succesfully after a firmware upgrade.

Wed, Jun 19, 2:34 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi added a comment to T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage.

As suggested by volans I tried running the firmware-upgrade cookbook on the other cumin server which had the correct version cached. That just finished, so I'm trying to reimage cloudvirt1042 with the 21.81 firmware release now.

Wed, Jun 19, 1:50 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi created T367974: firmware-update: spicerack.redfish.RedfishError: iDRAC is not ready. The configuration values cannot be accessed. Please retry after a few minutes..
Wed, Jun 19, 1:47 PM · SRE-tools, Infrastructure-Foundations, SRE
taavi added a project to T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage: ops-eqiad.
Wed, Jun 19, 12:55 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi added a comment to T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage.

cloudvirt1043 seems to be having the same issue too. So this may be an issue for the entire batch.

Wed, Jun 19, 12:55 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi renamed T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage from hw troubleshooting: cloudvirt1042 fails to boot after a reimage to hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage.
Wed, Jun 19, 12:53 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi created T367971: hw troubleshooting: cloudvirt1042, cloudvirt1043 fails to boot after a reimage.
Wed, Jun 19, 12:52 PM · SRE, ops-eqiad, cloud-services-team (Hardware), DC-Ops
taavi closed T367967: Requesting content administrator access for Kamila Součková as Resolved.
Wed, Jun 19, 12:30 PM · wikitech.wikimedia.org
taavi edited projects for T367964: Provision more non-NFS k8s workers, added: Toolforge; removed Toolforge (Toolforge iteration 11).
Wed, Jun 19, 11:55 AM · Toolforge (Toolforge iteration 11)
taavi triaged T367964: Provision more non-NFS k8s workers as Medium priority.
Wed, Jun 19, 11:53 AM · Toolforge (Toolforge iteration 11)
taavi added a comment to T355663: Allocate more available UNIX UIDs for human users.

Currently the highest number in use is 47058. So that's 1081 accounts in the 148 days since I created this task, or about 7.3 accounts per day. Assuming a similar rate of growth we're looking at running out of numbers in about 400 days, which would be late July next calendar year.

Wed, Jun 19, 11:22 AM · User-MoritzMuehlenhoff, Bitu, Infrastructure-Foundations, cloud-services-team, LDAP
taavi triaged T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes as High priority.
Wed, Jun 19, 10:36 AM · Toolforge (Toolforge iteration 11)
taavi created T367961: envvars-api 0.0.50 depends on unreleased envvars-cli changes.
Wed, Jun 19, 10:36 AM · Toolforge (Toolforge iteration 11)
taavi added a comment to T367956: haproxy: install some command line interface.

fwiw, I tend to just port-forward the stats interface on port 8404 to my laptop.

Wed, Jun 19, 9:50 AM · User-aborrero, cloud-services-team