User Details
- User Since
- Jun 9 2015, 9:03 AM (547 w, 1 d)
- Availability
- Available
- IRC Nick
- dcausse
- LDAP User
- DCausse
- MediaWiki User
- DCausse (WMF) [ Global Accounts ]
Tue, Dec 2
forgot to mention that the reindex was started yesterday on the two other clusters (eqiad, cloudelastic)
Mon, Dec 1
A/B test results on other wikis: https://people.wikimedia.org/~dcausse/T404858-completion-default-sort-2.html
Going to start the reindex today
Thu, Nov 27
went with the approach of enabling on the five georgian wikis at once, please let me know if a more conservative approach (one wiki first) is preferable.
The update process has been fixed.
Existing stale data in the search index will get fixed when:
- a new revision of the page is created
- a template change propagates
- when the continuous cleanup mechanism processes a page with stale data
Wed, Nov 26
CirrusSearch has to be careful when specifying timeouts of a regex query.
Regex queries are particularly costly and may cause a lot of stress on the servers if not properly protected.
The 15s timeouts has been setup for this, to ensure that the search backend return before any other timeouts are applied otherwise this might mean that a costly query mist continue to run outside of the concurrency protection (T152895).
Unless you noticed that that the regex got a lot slower recently and more queries I think it is safer to keep the 15s internal timeout.
Mon, Nov 24
Thu, Nov 20
Thanks for reporting this, I think there are two different issues that allowed such suggestions to appear:
- defaultsort is indeed not properly removed from the search index when it's erased, a null value unfortunately tells the system to ignore it when updating it, this needs to be fixed for this field
- defaultsort values are allowed to help completion only if they match a particular pattern, this pattern seems too permissive and should be corrected to limit the possibility of such vandalism to impact search suggestions in the future
Wed, Nov 19
Should be ready once 1.46.0-wmf.3 is deployed, earliest would be Thursday nov 20 but probably safer to wait til the following week in case we rollback.
Tue, Nov 18
it is a bit cumbersome to run unfortunately and some adaptations have to be made (we only used it to backfill article countries). The script is in stat1009.eqiad.wmnet:~dcausse/articlecountry:
- backfill_articlecountry.scala the spark job that reads hdfs://analytics-hadoop/user/dcausse/topic_model/wiki-region-groundtruth/regions-cirrus-upload.tsv.gz and convert it to classification.prediction.articlecountry weighted tags, this one would have to be adapted based on your source data
- wiki.lst: the list of wikis to filter on
- backfill.sh the shell script that orchestrates all this
Mon, Nov 17
Indeed, the new debian package wmf-opensearch-search-plugins version 1.3.20+12 has to be installed to run the lastest cirrus version. We generally maintain the cirrussearch-opensearch-image docket image that is used by MW developers and our cirrus integration test suite, but here I think that you install opensearch on the existing quibble image and thus refreshing this image with the new version of the plugin is indeed what should be needed.
This should be fixed, I can see the partial search response instead of the error.
@TJones the change should be live on hewiki and ruwiki, could you draft a message for the tech news possibly by adding some text to https://meta.wikimedia.org/wiki/Tech/News/2025/48?
We received notifications from users that the search API which is configured to allow 50s timeouts to support costly search requests is now failing at 15s with an upstream request timeout (T410007). The user reported that the behavior started to change around nov 11th which is apparently when we started to roll out this new route on group2 wikis. I'm not 100% sure that this change is the cause of this new behavior but IIUC on all wikis except enwiki we now route api.php requests to the rest-gateway. If I'm not mistaken the rest-gateway has a default timeout of 15s which might explain this new behavior? Are there ways to vary this timeout based on the target action API?
Indeed, the internal timeout should be 50s to allow the regex to run. It is possible that something changed in the request flow that there is now a component failing earlier than the allowed 50s.
Fri, Nov 14
If pushing to kafka-main you might need to increase broker's message.max.bytes see T344688.
Thu, Nov 13
Fri, Nov 7
Thu, Nov 6
I think this is now fixed, the behavior of items and lexemes should be the same.
The API response looks like this now (on L7 when searching for L7):
{Wed, Nov 5
Tue, Nov 4
Nov 3 2025
yes the event you crafted is perfect it's just missing the meta section, also please use event-gate to push them (otherwise you would bypass the various event platform validations):
curl -H"User-Agent: achou-T401021/wmf" -H"Content-Type: application/json" -XPOST https://eventgate-main.discovery.wmnet:4492/v1/events -d '[events]'
Event:
{ "meta": { "stream": "mediawiki.cirrussearch.page_weighted_tags_change.v1", "domain": "test.wikipedia.org" }, "dt": "2025-11-03T10:56:00Z", "wiki_id": "testwiki", "page": { "page_id": 1, "page_title": "SomePage", "namespace_id": 0, "is_redirect": false }, "weighted_tags": { "set": { "recommendation.tone": [ { "tag": "exists", "score": 1.0 } ] } } }
Oct 31 2025
Oct 29 2025
If only touching wgEnableEventBus to add TYPE_EVENT is there a risk to start producing data to unrelated streams where downstream consumers might be a bit puzzled to see things from loginwiki? If yes perhaps constraining loginwiki to only produce to mediawiki.product_metrics.suggested_investigations_interaction should be preferred?
Oct 28 2025
@TJones could you try to replicate this step by step from a fresh repository (cleanly fetched from gerrit and no target folders).
By default I think maven is supposed to ignore these folders when moving them to the target folders. I believe we should find what is causing them to appear in the first place because I'm afraid that fixing duplicate-finder-maven-plugin might be too late because it'd mean that these .DS_Store folders already have leaked into the build directories which could cause them to be published silently.
If on the other hand you found third-party jars (dependencies we use) with such folder in them this is a different story, we might possibly try to raise the issue to the lib owner and use the workaround you suggested to make our build pass (but haven't hit this problem myself so hopefully it's not the case).
I believe the issue is that we generally prefer EntityIdSearchHelper when matching lexeme IDs, this search helper does not have any customization for lexemes. A simple approach is to prefer the Cirrus version of the hits which contains the expected set of metadata, this might not fully solve the issue in cases where the Lexeme ID is searched rapidly after being created (searched before the search engine is updated) but this is perhaps good enough for now?
Oct 27 2025
Sorry I did not paste this query which is:
SELECT (COUNT(*) AS ?C) WHERE {
<https://commons.wikimedia.org/wiki/Category:Current_De_Havilland_Canada_aircraft_of_Universal_Air> ?p <https://commons.wikimedia.org/wiki/Category:Universal_Air_current_fleet> .
}I tested this it and all nodes seem to have the missing link.
I would consider this ticket done.
We should wait for https://gerrit.wikimedia.org/r/c/search/extra/+/1198569 I think
Oct 24 2025
I think the problem is in the extra plugin, I could reproduce it when the weighted_tags is current null in opensearch with the following bulk sequence:
{"index": {"_index": "my_database_content", "_id": "10000"}} {} {"update": {"_index": "my_database_content", "_id": "10000"}} {"script":{"source":"super_detect_noop","lang":"super_detect_noop","params":{"handlers":{"weighted_tags":"multilist","version":"documentVersion"},"source":{"version":1,"weighted_tags":["mytag/__DELETE_GROUPING__","myothertag/somedata|2"]}}},"upsert":{"version":1}}
CirrusSearch may provide such sorting options via T40403. I believe that this ticket refers to the UI work required to happen when this capability is available from CirrusSearch, given that there is no such selector in Special:Search I have the impression that the intent of this ticket was to adapt Advanced-Search, @thiemowmde am I missing something?
Opened https://gitlab.wikimedia.org/repos/search-platform/opensearch-plugins-deb/-/merge_requests/12 that should have all this.
Oct 23 2025
Oct 21 2025
Looking at the lag data of the category graph it seems to me that around june 12 2015 we started to have more lag issues:
It's perhaps not enough data to draw any conclusions but perhaps something to look into if something has changed around this period.
Actually the graph seems outdated on all 4 endpoints I tested.
The query:
SELECT ?out WHERE { SERVICE mediawiki:categoryTree { bd:serviceParam mediawiki:start <https://commons.wikimedia.org/wiki/Category:Aircraft_of_Universal_Air> . bd:serviceParam mediawiki:direction "Reverse" . bd:serviceParam mediawiki:depth 5 . } } ORDER BY ASC(?depth) LIMIT 200
Should return more than two items:
for endpoint in wdqs-internal-main.svc.eqiad.wmnet wdqs-main.svc.eqiad.wmnet wdqs-internal-main.svc.codfw.wmnet wdqs-main.svc.codfw.wmnet; do echo -n "$endpoint: "; curl -s -XPOST --data-urlencode [email protected] https://$endpoint/bigdata/namespace/categories/sparql?format=json | jq '.results.bindings | length'; done
wdqs-internal-main.svc.eqiad.wmnet: 2 wdqs-main.svc.eqiad.wmnet: 2 wdqs-internal-main.svc.codfw.wmnet: 2 wdqs-main.svc.codfw.wmnet: 2
