iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: http://phabricator.wikimedia.org/p/eoghan/
♟ eoghan
Page MenuHomePhabricator

eoghan
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Jan 23 2023, 12:05 PM (96 w, 3 d)
Availability
Available
LDAP User
EoghanGaffney
MediaWiki User
EGaffney-WMF [ Global Accounts ]

Recent Activity

Yesterday

Dzahn awarded T377045: Message content lost when mailing list is the only recipient a Party Time token.
Wed, Nov 27, 1:39 AM · collaboration-services, SRE, Wikimedia-Mailing-lists

Tue, Nov 26

eoghan closed T347004: Create a staging apt repository for CI-based builds of Debian packages as Resolved.

The pipeline seems to be working correctly, and we've got documentation in place. I think we can close this out as completed!

Tue, Nov 26, 1:38 PM · collaboration-services, GitLab (CI & Job Runners), serviceops
eoghan closed T347004: Create a staging apt repository for CI-based builds of Debian packages, a subtask of T304491: Standardize Debian package builds on GitLab CI, as Resolved.
Tue, Nov 26, 1:37 PM · collaboration-services, GitLab (CI & Job Runners), serviceops

Mon, Nov 25

eoghan added a comment to T377045: Message content lost when mailing list is the only recipient.

We've been doing some investigating over the last week, and it's a very hard problem to track down. No useful information is seen in the logs, and there's no exceptions or anything that would indicate what's been happening.

Mon, Nov 25, 11:26 AM · collaboration-services, SRE, Wikimedia-Mailing-lists

Fri, Nov 22

eoghan added a comment to T380396: spamassassin broken for VRTS.

I see what's going on. There are three checks provided by VALIDITY configured by default. RCVD_IN_VALIDITY_CERTIFIED, RCVD_IN_VALIDITY_SAFE, and RCVD_IN_VALIDITY_RPBL. We disabled the first one but not the other two, which is why we saw a small but measurable decrease in false clean messages, and why we're still seeing the other VALIDITY checks in spam messages. These can be seen configured here. I also checked that they were responding to all queries with the same exceeded error message.

Fri, Nov 22, 7:09 PM · collaboration-services, Infrastructure-Foundations, Mail, vrts, Znuny

Thu, Nov 21

eoghan added a comment to T380396: spamassassin broken for VRTS.

I've put in a change to disable this specific check, we're also going to look at whether we can sign up for an account with them to get a higher usage limit.

Thu, Nov 21, 8:39 PM · collaboration-services, Infrastructure-Foundations, Mail, vrts, Znuny

Tue, Nov 19

Ruthven awarded T380009: VRTS e-mail address unreachable / e-mail routing issue a Cookie token.
Tue, Nov 19, 1:33 PM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts

Mon, Nov 18

eoghan added a comment to T380009: VRTS e-mail address unreachable / e-mail routing issue.

@jhathaway It was a rule set up to change the envelope-to of a mail from a given source. When we disabled the rule, gmail started returning 550s for any address unknown in the wm.o domain, but when the rule was re-enabled, it was back to 250/ok for anything unknown. When we set the "Account types to affect" not to include the catch-all, it started returning 550s again. It's not clear whether leaving the catch-all unchecked is desirable behaviour on the ITS side, waiting to hear back on that. I'm sure there'll be a way to work around it if we have to.

Mon, Nov 18, 9:17 PM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts
eoghan added a comment to T380009: VRTS e-mail address unreachable / e-mail routing issue.

We had a quick chat with ITS today where they disabled the change that caused the routing to change, and it did cause gmail to start returning 550 for unknown addresses again, so we have confirmed their change was what caused this to start behaving differently.

Mon, Nov 18, 6:09 PM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts
eoghan claimed T380009: VRTS e-mail address unreachable / e-mail routing issue.
Mon, Nov 18, 10:17 AM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts

Fri, Nov 15

eoghan lowered the priority of T380009: VRTS e-mail address unreachable / e-mail routing issue from Unbreak Now! to High.

We've made a change to the aliases routing script which we believe has fixed the problem. I've verified that mail is delivering to vrts now, and we've seen two of our test tickets arrive.

Fri, Nov 15, 11:56 AM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts
eoghan added a comment to T380009: VRTS e-mail address unreachable / e-mail routing issue.

So the issue is coming from the vrts_aliases.py cron job. Something has changed in how gmail is responding to emails here and claiming they're valid. I'm going to try see if we can change that to ignore the gmail check

Fri, Nov 15, 11:12 AM · Patch-For-Review, collaboration-services, User-revi, Infrastructure-Foundations, Mail, SRE, Znuny, vrts

Tue, Nov 5

eoghan added a comment to T377045: Message content lost when mailing list is the only recipient.

The bug for multiple mailing lists was fixed several years ago: https://gitlab.com/mailman/mailman/-/issues/955 (so, hopefully, the fix is included on our mailman version)

Tue, Nov 5, 10:53 PM · collaboration-services, SRE, Wikimedia-Mailing-lists

Oct 24 2024

eoghan added a comment to T372586: [vrts] Investigate slow loading search modal.

We've been talking about this back and forth with znuny, they have a few suggestions, tracking here to see what we can try:

Oct 24 2024, 2:39 PM · Znuny, collaboration-services

Oct 21 2024

eoghan added a comment to T377643: SystemdUnitFailed - phab2002 - phabricator_task_dump / fix mysql grants for phab2002 after IP change.

Silenced these alerts for 48 hours.

Oct 21 2024, 10:45 AM · collaboration-services

Oct 4 2024

eoghan closed T375735: lists1004 interfaces file contains extra lines as Resolved.

The file is cleaned up, closing.

Oct 4 2024, 2:23 PM · collaboration-services

Oct 3 2024

brennen awarded T356077: Inbound mail to phabricator doesn't work a Unicorn! token.
Oct 3 2024, 6:18 PM · User-notice-archive, Python3-Porting, Release-Engineering-Team (Priority Backlog 📥), User-brennen, Phabricator, collaboration-services
valerio.bozzolan awarded T356077: Inbound mail to phabricator doesn't work a Fox token.
Oct 3 2024, 6:17 PM · User-notice-archive, Python3-Porting, Release-Engineering-Team (Priority Backlog 📥), User-brennen, Phabricator, collaboration-services

Oct 1 2024

eoghan added a comment to T375735: lists1004 interfaces file contains extra lines.

I commented out the extra lines and rebooted, the host came back up and mail is flowing as expected.

Oct 1 2024, 10:48 AM · collaboration-services

Sep 26 2024

eoghan added a comment to T375735: lists1004 interfaces file contains extra lines.

208.80.154.21/32 and 2620:0:861:1:208:80:154:21/128 are both from the linked puppet change, but the third address, 2620:0:861:1:208:80:154:81/64, isn't. This was generated somewhere else.

Sep 26 2024, 12:02 PM · collaboration-services
eoghan updated the task description for T375735: lists1004 interfaces file contains extra lines.
Sep 26 2024, 11:54 AM · collaboration-services
eoghan created T375735: lists1004 interfaces file contains extra lines.
Sep 26 2024, 11:54 AM · collaboration-services

Sep 24 2024

eoghan updated the task description for T370677: migrate all sre-collab services to nftables.
Sep 24 2024, 5:00 PM · Patch-For-Review, collaboration-services

Sep 17 2024

eoghan closed T373846: Move mailman2 data (/var/lib/mailman) to separate partition as Resolved.

This is no longer an issue

Sep 17 2024, 10:59 AM · collaboration-services

Sep 9 2024

eoghan closed T374320: SystemdUnitFailed (lists2001) as Resolved.

This will mask the service from starting inadvertently, and should stop these alarms.

Sep 9 2024, 2:39 PM · collaboration-services

Sep 6 2024

eoghan closed T374186: SystemdUnitFailed - lists2001 - mailman3 as Resolved.

This was an after-effect of rebooting the host.

Sep 6 2024, 3:49 PM · collaboration-services
eoghan closed T374194: SystemdUnitFailed (lists2001) as Resolved.

This was due to the other list host rebooting.

Sep 6 2024, 2:19 PM · collaboration-services

Sep 5 2024

eoghan renamed T374147: SystemdUnitFailed - mailman3 - lists2001 - gitlab1004 from SystemdUnitFailed (lists2001) to SystemdUnitFailed.
Sep 5 2024, 10:08 PM · collaboration-services
eoghan closed T374096: SystemdUnitFailed (lists2001) as Resolved.

This was me when restarting the host related to T373980

Sep 5 2024, 10:03 PM · collaboration-services
eoghan renamed T374147: SystemdUnitFailed - mailman3 - lists2001 - gitlab1004 from SystemdUnitFailed to SystemdUnitFailed (lists2001).
Sep 5 2024, 10:02 PM · collaboration-services
eoghan added a comment to T373980: Hosts using nftables are not reachable via ssh from alert[12]002. Reboot needed..

Yeah, that's right -- we moved from ferm to nftables, but then reverted because of T373637. I'll take a look at the cleanup later this afternoon.

Sep 5 2024, 3:43 PM · collaboration-services, Infrastructure-Foundations, SRE Observability (FY2024/2025-Q1), Observability-Alerting
eoghan updated subscribers of T373846: Move mailman2 data (/var/lib/mailman) to separate partition.

@fgiunchedi pointed out that there was still some space free in the VG, so the volume could be expanded instead. There's approximately 90G free on the disk unallocated, and since the mailman2 data will never grow (the newest file in that directory is 2001), this should give us sufficient headroom for logrotate to take care of the rest.

Sep 5 2024, 2:30 PM · collaboration-services
eoghan updated the task description for T373980: Hosts using nftables are not reachable via ssh from alert[12]002. Reboot needed..
Sep 5 2024, 12:35 PM · collaboration-services, Infrastructure-Foundations, SRE Observability (FY2024/2025-Q1), Observability-Alerting

Sep 3 2024

eoghan claimed T373846: Move mailman2 data (/var/lib/mailman) to separate partition.
Sep 3 2024, 9:17 AM · collaboration-services
eoghan created T373846: Move mailman2 data (/var/lib/mailman) to separate partition.
Sep 3 2024, 9:17 AM · collaboration-services

Aug 23 2024

eoghan closed T371222: Backups are failing on the GitLab test instance as Resolved.

I've changed the backup script to tolerate a failure in the prometheus-pushgateway, so this can be closed.

Aug 23 2024, 10:51 AM · Patch-For-Review, GitLab, collaboration-services

Aug 22 2024

eoghan closed T369017: SystemdUnitFailed - lists1004 - wmf_auto_restart_exim4 as Resolved.

This seems to have been a blip that hasn't reoccurred.

Aug 22 2024, 1:32 PM · collaboration-services

Aug 16 2024

eoghan closed T372369: Requesting access to ldap/wmf for divec as Resolved.

I've added the user to the wmf group. @dchan, I'm going to close this now, let me know if anything seems missing!

Aug 16 2024, 2:48 PM · SRE, SRE-Access-Requests
eoghan claimed T372369: Requesting access to ldap/wmf for divec.
Aug 16 2024, 9:55 AM · SRE, SRE-Access-Requests
eoghan reassigned T371796: Requesting access to <analytics-privatedata-users> for ifeatu_nnaobi_wmde from eoghan to odimitrijevic.

Hi @odimitrijevic, could you please look at this as an approver for the analytics-privatedata-users group? Thanks!

Aug 16 2024, 9:50 AM · Patch-For-Review, Data-Engineering, SRE, SRE-Access-Requests
eoghan updated the task description for T372369: Requesting access to ldap/wmf for divec.
Aug 16 2024, 8:40 AM · SRE, SRE-Access-Requests

Aug 15 2024

eoghan closed T372290: Grant Access to wmf for SaraiSan WMF as Resolved.

Confirmed working!

Aug 15 2024, 10:49 AM · SRE, LDAP-Access-Requests
eoghan claimed T371796: Requesting access to <analytics-privatedata-users> for ifeatu_nnaobi_wmde.
Aug 15 2024, 9:09 AM · Patch-For-Review, Data-Engineering, SRE, SRE-Access-Requests
eoghan assigned T371796: Requesting access to <analytics-privatedata-users> for ifeatu_nnaobi_wmde to Ifeatu_Nnaobi_WMDE.
Aug 15 2024, 9:08 AM · Patch-For-Review, Data-Engineering, SRE, SRE-Access-Requests

Aug 13 2024

eoghan claimed T372290: Grant Access to wmf for SaraiSan WMF.

Confirmed that the account was all correct as per the wiki , and have added the user to data.yaml, the WMF-NDA phabricator group, and the wmf LDAP group.

Aug 13 2024, 4:57 PM · SRE, LDAP-Access-Requests
eoghan added a member for WMF-NDA: Sarai-WMF.
Aug 13 2024, 4:50 PM

Jul 25 2024

eoghan updated the task description for T370973: GitLab Security Release 17.2.1, 17.1.3, 17.0.5.
Jul 25 2024, 10:13 PM · Vuln-VulnComponent, SecTeam-Processed, collaboration-services, GitLab, Security
eoghan updated the task description for T370973: GitLab Security Release 17.2.1, 17.1.3, 17.0.5.
Jul 25 2024, 10:01 PM · Vuln-VulnComponent, SecTeam-Processed, collaboration-services, GitLab, Security

Jul 9 2024

eoghan closed T331706: Migrate Mailman/lists to Bullseye/Bookworm as Resolved.

lists1001 has been decommissioned and all current hosts are running bookworm.

Jul 9 2024, 3:44 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan closed T331706: Migrate Mailman/lists to Bullseye/Bookworm, a subtask of T327068: Bullseye upgrade for remaining Collab hosts, as Resolved.
Jul 9 2024, 3:41 PM · collaboration-services
eoghan closed T367959: Mailman PTR records as Resolved.

This was mostly a brain-dump just before I left on PTO, so we're going to close this in favour of some of the other more detailed tasks, namely T278495: Figure out plan for mailman IP situation and T286066: Put lists.wikimedia.org web interface behind LVS

Jul 9 2024, 2:55 PM · collaboration-services

Jul 4 2024

eoghan closed T336555: mailman3 discard_held_messages systemd script apparently failing since 2023-03-26 as Resolved.

lists1003 doesn't exist anymore, so this can probably be closed.

Jul 4 2024, 2:16 PM · Wikimedia-Mailing-lists, SRE

Jul 2 2024

eoghan reassigned T367833: Update grants for mailman from eoghan to Ladsgroup.

Spoken with @Ladsgroup , I think there's nothing immediate for sre-collab to do here so reassigning. Feel free to send it back to me if that changes!

Jul 2 2024, 9:06 PM · DBA, collaboration-services, SRE
eoghan closed T283615: Make mailman3 work in the standby host (lists2001.wikimedia.org) as Resolved.

I think we can close this, since the puppet module now installs mailman3 on lists2001 (albeit disabled), unless I'm missing something

Jul 2 2024, 12:22 PM · collaboration-services, SRE, Datacenter-Switchover, Wikimedia-Mailing-lists
eoghan closed T283615: Make mailman3 work in the standby host (lists2001.wikimedia.org), a subtask of T331706: Migrate Mailman/lists to Bullseye/Bookworm, as Resolved.
Jul 2 2024, 12:22 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan added a comment to T331706: Migrate Mailman/lists to Bullseye/Bookworm.

lists1001 has been powered off, it will stay off for 1 week and then I'll decommission it fully on Tuesday, 9th July, after this we can close this ticket.

Jul 2 2024, 12:21 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan closed T368063: lists: Exim paniclog entries about Tainted filename for search: '/etc/exim4/aliases/lists.wikimedia.org' as Resolved.

This was fixed by the patch merged on June 20th.

Jul 2 2024, 9:53 AM · collaboration-services
eoghan closed T368682: PuppetDisabled - lists2001 as Resolved.
Jul 2 2024, 8:40 AM · collaboration-services
eoghan added a comment to T368682: PuppetDisabled - lists2001.

Puppet has been re-enabled and run successfully, sorry for the noise

Jul 2 2024, 8:40 AM · collaboration-services
eoghan merged task T369007: PuppetDisabled - lists2001 into T368682: PuppetDisabled - lists2001.
Jul 2 2024, 8:39 AM · collaboration-services
eoghan merged T369007: PuppetDisabled - lists2001 into T368682: PuppetDisabled - lists2001.
Jul 2 2024, 8:39 AM · collaboration-services

Jun 21 2024

eoghan added a comment to T331706: Migrate Mailman/lists to Bullseye/Bookworm.

The migration to the new host is done. The last remaining item before we can close this ticket is to decommission the old host. We're going to keep that around for two weeks after the migration, which will be Tuesday 2nd July. The host will be shut down on that date, and decommissioned on the Tuesday after.

Jun 21 2024, 7:10 AM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE

Jun 20 2024

Dzahn awarded T367975: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync a Like token.
Jun 20 2024, 5:38 PM · collaboration-services
eoghan closed T367975: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync as Resolved.

The patch for this was merged and should no longer be an issue.

Jun 20 2024, 3:36 PM · collaboration-services
eoghan created T368063: lists: Exim paniclog entries about Tainted filename for search: '/etc/exim4/aliases/lists.wikimedia.org'.
Jun 20 2024, 2:54 PM · collaboration-services
eoghan closed T367627: SystemdUnitFailed - lists2001 - rsync-mailman3-root as Resolved.
Jun 20 2024, 12:01 PM · collaboration-services
eoghan closed T367874: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync as Resolved.

Yep!

Jun 20 2024, 10:51 AM · collaboration-services
eoghan merged T368015: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync into T367975: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync.
Jun 20 2024, 10:10 AM · collaboration-services
eoghan merged task T368015: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync into T367975: SystemdUnitFailed - lists2001 - rsync-mailman-root-sync.
Jun 20 2024, 10:10 AM · collaboration-services

Jun 19 2024

eoghan updated the task description for T367959: Mailman PTR records.
Jun 19 2024, 10:02 AM · collaboration-services
eoghan created T367959: Mailman PTR records.
Jun 19 2024, 9:59 AM · collaboration-services
eoghan closed T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004 as Resolved.

The maintenance was completed yesterday and so far the service seems stable. I'm going to close this now, and we can re-open if we come across any issues.

Jun 19 2024, 9:10 AM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan closed T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004, a subtask of T331706: Migrate Mailman/lists to Bullseye/Bookworm, as Resolved.
Jun 19 2024, 9:07 AM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE

Jun 18 2024

eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 3:32 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
Dzahn awarded T331706: Migrate Mailman/lists to Bullseye/Bookworm a Orange Medal token.
Jun 18 2024, 3:29 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 12:42 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 12:42 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 10:47 AM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 10:18 AM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 18 2024, 9:04 AM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan added a comment to T367833: Update grants for mailman.

That's right -- we'll be doing that as part of the maintenance work later today. We kept them firewalled off so that the non-active host isn't writing to the database at the same time as the active. In the future it might make more sense to allow all hosts access but have a read/write user for the active host, and read only for the non-active.

Jun 18 2024, 7:47 AM · DBA, collaboration-services, SRE

Jun 17 2024

eoghan added a comment to T367833: Update grants for mailman.

It's possible that the grants are already covered by the proxies listed here, but it would be good to check before we start our migration

Jun 17 2024, 10:59 PM · DBA, collaboration-services, SRE
eoghan reassigned T367833: Update grants for mailman from eoghan to Ladsgroup.
Jun 17 2024, 10:57 PM · DBA, collaboration-services, SRE
eoghan created T367833: Update grants for mailman.
Jun 17 2024, 10:55 PM · DBA, collaboration-services, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 17 2024, 10:30 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE

Jun 14 2024

Quiddity awarded T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004 a Love token.
Jun 14 2024, 5:24 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan moved T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004 from Incoming to Work in Progress on the collaboration-services board.
Jun 14 2024, 3:22 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan triaged T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004 as High priority.
Jun 14 2024, 3:22 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan updated the task description for T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 14 2024, 2:56 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan added a comment to T331706: Migrate Mailman/lists to Bullseye/Bookworm.

I've created a sub-task for the migration itself so users and community members can follow the migration itself more easily, rather than trawling through comments and patch notifications. It's been tagged with User-notice so it ends up on tech news. The downtime will be on Tuesday 18th from 10-12 UTC.

Jun 14 2024, 2:48 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan created T367521: Mailman Downtime: Migrate mailman from lists1001 to lists1004.
Jun 14 2024, 2:44 PM · User-notice-archive, Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan added a comment to T331706: Migrate Mailman/lists to Bullseye/Bookworm.

Overall looks good. Just noting that rebuilding index will take a very long time and that can make the downtime quite longer. I wonder of we can just rsync the indexes and avoid that? We probably can also run rebuild index after migration (and note to people that search won't work for a while)

Jun 14 2024, 2:30 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE
eoghan closed T367469: SystemdUnitFailed - rsync-mailman3 as Resolved.
Jun 14 2024, 2:21 PM · collaboration-services
eoghan closed T366927: SystemdUnitFailed - lists1004 - mailman3 as Resolved.
Jun 14 2024, 2:14 PM · collaboration-services
eoghan closed T366899: PuppetFailure - lists1004 as Resolved.
Jun 14 2024, 2:13 PM · collaboration-services

Jun 10 2024

eoghan closed T366894: SystemdUnitFailed - lists1004 - mailman3 as Resolved.

This was due to the apt package starting the service (and failing) despite the puppet recipe being set to ensure => stopped

Jun 10 2024, 7:25 PM · collaboration-services

Jun 6 2024

eoghan added a comment to T331706: Migrate Mailman/lists to Bullseye/Bookworm.

The rough outline for migration is:

Jun 6 2024, 3:56 PM · Patch-For-Review, collaboration-services, Wikimedia-Mailing-lists, SRE

May 31 2024

Dzahn awarded T365768: Upgrade Gitlab embedded version of Postgres a Doubloon token.
May 31 2024, 6:52 PM · collaboration-services

May 30 2024

eoghan updated Other Assignee for T365768: Upgrade Gitlab embedded version of Postgres, added: Jelto.

I've also run the upgrade on gitlab1004, now only leaving the primary (gitlab2002) left to be upgraded.

May 30 2024, 10:38 AM · collaboration-services

May 28 2024

eoghan added a comment to T365768: Upgrade Gitlab embedded version of Postgres.

I ran the test upgrade (sudo gitlab-ctl pg-upgrade) this afternoon on gitlab1003, and it succeeded. The total time required was 1m51s, and generated no error messages.

May 28 2024, 4:10 PM · collaboration-services