📊 Full opportunity report: When a Content Network Starts Publishing to Itself on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A content network with 474 WordPress sites is publishing content predominantly to a small subset of sites, leaving over half inactive. The issue stems from internal system biases and supply-demand mismatches, raising concerns about network health and content diversity.

A large automated content network is publishing predominantly to a small subset of its sites, leaving over half of the sites without new content for weeks. This internal publishing loop, confirmed through recent audits, raises questions about the system’s health and content diversity, affecting SEO and user engagement.

The network consists of 474 WordPress sites managed by two interconnected systems: Stenvrik, which sources and evaluates news signals, and DojoClaw, which rewrites and distributes content. Recently, an audit revealed that 80% of posts are concentrated on just 8% of the sites, mainly in the technology and AI categories. Meanwhile, over half of the sites received no new content in a 28-day window, indicating a self-publishing loop where the system favors certain sites and neglects others. The issue was traced to two causes: within-topic concentration, where the content matching algorithm kept favoring the same popular sites, and a supply mismatch, where the majority of content was tech-focused, but most sites covered other topics like health or food. The fix involved adjusting the distribution algorithm to prioritize less active sites and diversify content placement, using caps and recency-based selection to ensure broader distribution and prevent overloading favored sites.

Balancing a 474-site network — ThorstenMeyerAI.com
ThorstenMeyerAI.com
AI & Tooling · Engineering Note
Systems at scale

When a content network starts publishing to itself

A 474-site network quietly collapsed onto 38 of its own favorites while half the catalog went dark. The throughput graph looked fine. The fix wasn’t one thing — it was two causes and a three-part repair across two decoupled systems.

Stenvrik

News-intelligence layer

Ingests hundreds of feeds, scores & geo-tags stories, surfaces what’s trending.

SUPPLY · what’s worth covering
DojoClaw

AI content engine

Rewrites a story in each site’s voice and fans it out across the catalog.

PLACEMENT · where it lands & how it reads
01The symptom

80% of output on 8% of sites

A 28-day audit, bucketed per site, was lopsided in a way the totals had hidden. Every individual placement was “correct” — the aggregate was a slow-motion failure.

Where 28 days of syndication actually landed

474-site catalog · per-site audit
Top 38 sites8% of catalog
80% of all posts
Top 4 sitesall tech titles
200+ articles/week each
249 sites53% of catalog
ZERO posts — half the network dark
02The diagnosis · refuse the obvious
Build a WordPress Website From Scratch 2026: Step-by-step: New WordPress 6.9 and Gutenberg: WordPress 7: What is new?

Build a WordPress Website From Scratch 2026: Step-by-step: New WordPress 6.9 and Gutenberg: WordPress 7: What is new?

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Not one bug — two independent causes

The tempting move is to blame the matcher and move on. The data showed two distinct problems living on two different systems, each needing its own fix.

Cause 1 · DojoClaw

Within-topic concentration

The matcher kept surfacing the same broad tech sites for every tech story, and rotation only shuffled candidates within the matched pool. A site that never entered the pool could never get a turn — fair only among the already-chosen.

Cause 2 · Stenvrik

Supply ≠ demand

53% of supplied content was tech/AI — but only ~13% of sites are. The catalog skews the other way, so those sites starved for on-topic material.

supply
tech/AI content in53%
demand
tech/AI sites in catalog~13%
03The load balancer · flip it
SEO Competitor Audit Journal: Perfect SEO tool and journal to audit, track and log your competitor’s SEO strategy

SEO Competitor Audit Journal: Perfect SEO tool and journal to audit, track and log your competitor’s SEO strategy

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Watch the network rebalance

Each square is one of the 474 sites; color is how much it’s publishing. Toggle the selection logic to see placement spread off the red-hot favorites and into the dark long tail.

Placement simulator

Same matcher relevance gate either way — the only change is how candidates are ordered after it.

38
sites carrying 80% of posts
249
dark sites · zero posts
overloaded
hottest sites at ~30/day
dark · 0 light healthy busy overloaded
04The three-part fix
Architecting AI Software Systems: Crafting robust and scalable AI systems for modern software development

Architecting AI Software Systems: Crafting robust and scalable AI systems for modern software development

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Placement, supply, throughput

Two causes meant the fix had to touch both systems — and only then could the ceiling rise without re-concentrating the load.

1

Placement levers

DojoClaw
  • Per-site weekly cap — any site over 25 posts/7d drops from the pool, pushing selection into the long tail (relaxes only if it would starve a fan-out).
  • Global LRU — order by network-wide recency, not just within-topic, so sites idle across the whole network float to the top.
  • Starvation floor — guaranteed by construction: the most-idle eligible site is always within the picks.
2

Supply rebalance

Stenvrik
  • Audited existing feeds for liveness — removed ones returning HTTP 200 but zero items (broken RSS).
  • Added a verified batch across Home, Garden, Health, Food, Fashion, Auto, Science, Pets & more — every feed fetched live first, weighted to the most idle categories.
  • Flagged throttled feeds (big publishers exposing only 1–2 items) for replacement rather than burying the risk.
3

Throughput raise

Scheduler
  • Fan-out width maxSites 5 → 7 — the extra slots land on fresh sites because the cap is now enforcing.
  • Quota depth K 2 → 3 — every category’s daily cap scaled ×1.5.
  • Honest note: a documented ~950/day intent the code never delivered (units quirk) stays gated behind a sign-off.
05What it adds up to
Content Strategy Toolkit, The: Methods, Guidelines, and Templates for Getting Content Right (Voices That Matter)

Content Strategy Toolkit, The: Methods, Guidelines, and Templates for Getting Content Right (Voices That Matter)

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

The scoreboard — with an honest asterisk

The change is behavioral: it shapes future placement, it doesn’t retroactively rescue the month sites sat dark. The proof is in the next weeks of data — which is why the instrumentation is the real deliverable.

Metric
Before
After
Concentration
80% on 38 sites
cap + LRU + floor
Dormant sites
249 (53%)
shrinking ↓
Feed sources
245
271 verified
Daily ceiling
~188/day
~280/day · +49%
Fan-out width
5
7
Why two systems, not one

Supply and placement are genuinely separate concerns. Diagnosing the imbalance meant looking at both sides and seeing they disagreed. A clean boundary made a failure that spanned both legible — good system boundaries organize thought, not just code.

The tradeoff taken

Ordering by load & idleness sacrifices a little topical ranking for dramatically better coverage. All candidates already cleared the relevance gate — so it’s a deliberate trade, not a regression.

ThorstenMeyerAI.com
Stenvrik (news-intelligence) ↔ DojoClaw (content engine) · figures reflect the May 2026 engineering audit & the behavioral changes made in response · the network’s response is being tracked.

Implications for Network Content Diversity and SEO

This pattern of self-publishing to a limited set of sites can harm the overall health of the content network by reducing diversity, increasing spam-like signals, and potentially damaging search engine rankings. It highlights the risks of automated systems developing unintended biases, which can lead to atrophy of less active sites and skewed content availability, ultimately undermining the network’s value for users and publishers alike.

Origins of Automated Content Distribution Challenges

The system was designed with a separation of concerns: Stenvrik handles news signal curation, while DojoClaw manages content rewriting and distribution. Prior to this issue, the network operated under the assumption that the algorithms would evenly distribute content. However, recent audits revealed a pattern where the system’s internal logic favored certain sites, particularly in tech categories, leading to an imbalance. Similar issues have been observed in other large-scale automated publishing systems, where the lack of explicit controls on distribution can cause over-concentration on popular nodes, with less attention paid to the overall network health.

"The system was working perfectly at each decision point, but the aggregate behavior was skewed, leading to a self-reinforcing publishing loop that we hadn't anticipated."

— Thorsten Meyer, system operator

Extent and Long-term Impact of Self-Publishing Loop

It remains unclear how widespread the long-term effects are on search rankings, user engagement, and whether similar patterns exist in other networks. The full impact of the imbalance will only be understood after ongoing monitoring and further audits.

Monitoring and Further Algorithm Refinements Expected

The system administrators plan to continue monitoring the distribution patterns closely, with upcoming updates to the content placement algorithms aimed at preventing recurrence. Additional controls may be introduced to ensure more equitable distribution across all sites, and further audits are scheduled to assess the effectiveness of these measures over the coming months.

Key Questions

Why is publishing to itself a problem for the content network?

It leads to over-concentration of content on a few sites, reducing diversity, risking SEO penalties, and causing many sites to become inactive, which diminishes the network’s overall value.

How did the system develop this self-publishing loop?

The algorithms favored certain popular sites within specific topics, and supply-demand mismatches meant most content was focused on tech, leaving other categories starved, which caused the system to repeatedly publish to the same sites.

What measures are being taken to fix this issue?

Adjustments include caps on site publication frequency, recency-based selection to prioritize inactive sites, and diversification of content placement to prevent overloading favored sites.

Could this pattern happen in other automated systems?

Yes, especially in systems with decoupled modules and insufficient controls on distribution logic. Ongoing monitoring and algorithmic safeguards are essential to prevent such biases.

What are the potential long-term effects if the issue is not addressed?

Persistent imbalance could harm search engine rankings, reduce user engagement, and cause the network to lose its effectiveness and credibility over time.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.
You May Also Like

Build vs Buy a Prebuilt AI Workstation

Exploring whether to build or buy a prebuilt AI workstation in 2026, considering recent market shifts, thermal management, and cost factors.

Purchase order exception tracker for small manufacturers

A new purchase order exception tracker for small manufacturers is set to be tested, aiming to improve supplier order management amid supply volatility.

One Video In, a Whole Publishing Kit Out — Without the Cloud

Discover how local-first video processing transforms content creation. Generate all your assets from a single source without relying on cloud services.

The Compute Reckoning: Anthropic Finally Admits What Customers Suspected for Ten Months

Anthropic confirms that compute shortages led to recent customer rate limits, marking a shift from previous claims of strategic product decisions.