Hong Kong's government databases, hospital record systems and land registry archives are sitting on years of accumulated duplicate imagery — redundant scans, re-uploaded photographs and mirrored files that are quietly inflating storage costs, slowing retrieval times and, in some cases, creating genuine administrative confusion. The question now is not whether the problem exists. It is who fixes it, how fast, and at whose expense.
The issue has gained urgency because the city's major digital modernisation push, anchored in the Digital Policy Office's blueprint released in late 2024, set hard targets for consolidating government data infrastructure by mid-2026. That deadline has now passed. Several departments, including the Lands Registry on Queensway and the Hospital Authority's central records division, are understood to be operating data environments where duplicate imagery accounts for a measurable share of stored files — though no official consolidated figure has been published to date.
Why the Timing Matters
The Greater Bay Area integration agenda has added pressure. Hong Kong institutions sharing data pipelines with Mainland counterparts in Shenzhen and Guangzhou need clean, non-redundant image records to meet cross-boundary data standards introduced under the GBA Data Flow framework. Duplicate files are not merely a storage nuisance; they become a compliance liability when records flow across the boundary under the Personal Information Protection Law provisions that apply to Mainland-linked systems.
The Hospital Authority alone manages imaging records for 44 public hospitals and institutions across Hong Kong. Medical imaging — CT scans, X-rays, MRI outputs — represents some of the heaviest data loads in any public-sector archive. The HA's digital records modernisation program, which has been running in phases since 2022, has repeatedly identified legacy duplication as a core technical debt. Each duplicated file that survives a migration cycle carries forward storage and indexing costs that compound over time.
Commercial landlords and estate agents operating out of offices along Des Voeux Road Central have separately flagged a parallel problem in property listing platforms, where the same unit photographs appear under multiple listing IDs — a known irritant to buyers and a minor but real source of legal ambiguity in cases where image ownership is disputed.
The Decisions Ahead
Three choices now define the path forward. First, the Digital Policy Office must decide whether to mandate a single deduplication standard across all bureaux or allow departments to procure their own solutions — a decentralised approach that has historically produced patchy results in the city's IT procurement history. The Innovation and Technology Bureau has previously favoured department-level flexibility, but the GBA compliance argument is shifting that calculus.
Second, the Hospital Authority faces a specific fork: run deduplication against live archives, which risks brief service disruptions at facilities including Queen Mary Hospital in Pok Fu Lam and Prince of Wales Hospital in Sha Tin, or stage the clean-up across a rolling multi-year window that defers the compliance benefit. A phased approach covering the busiest imaging departments first would likely take 18 to 24 months on current project timelines.
Third, private-sector platforms — particularly the property portals and legal document services concentrated in the Central and Sheung Wan districts — will need to decide whether to adopt the government's eventual standard voluntarily or wait for any legislative signal. The Communications Authority has not signalled imminent regulatory action on commercial image databases, but the Article 23 data-security framework has already made companies more attentive to what sits in their servers.
The practical upshot for organisations watching this space: institutions with cross-boundary data obligations should begin auditing their image repositories now, before any formal standard is gazetted. The cost of a proactive audit is almost certainly lower than the cost of a reactive overhaul imposed on a compressed government timeline. For public-sector bodies, the Digital Policy Office is expected to publish updated guidance in the third quarter of 2026 — and that document will set the terms of debate for everything that follows.