Skip to main content
The Daily Hong Kong

Hong Kong news, every day

News

Hong Kong's Duplicate Image Problem: The Numbers Driving a Digital Clean-Up

Government portals, property listings and media archives are sitting on millions of redundant image files — and the cost of doing nothing is climbing.

Share

By Hong Kong News Desk · Published 5 July 2026 at 4:45 am

4 min read

Updated 3 h ago· 5 July 2026 at 1:57 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Hong Kong is independently owned and covers Hong Kong news free from advertiser or sponsor influence. Read our editorial standards →

Hong Kong's Duplicate Image Problem: The Numbers Driving a Digital Clean-Up
Photo: Photo by eddie chuachoonhui on Pexels

Hong Kong's public and commercial digital infrastructure is carrying an estimated tens of millions of duplicate image files across government portals, property listing platforms and news archives, according to a review of procurement documents filed with the Government Logistics Department in the first half of 2026. The hidden bill — measured in wasted server capacity, slower load times and ballooning cloud storage fees — is quietly becoming a line item that IT managers can no longer ignore.

The issue landed on desks across Wan Chai and Admiralty in earnest after the Office of the Government Chief Information Officer published its Digital Government Blueprint 2.0 update in March 2026, which set a target of reducing redundant data storage across bureau systems by 30 percent before the end of the 2027–28 financial year. That single policy commitment has forced dozens of departments to actually count what they have — and the numbers have been uncomfortable.

What the Data Shows

Storage audits conducted under the Blueprint framework found that image duplication rates across legacy government content management systems averaged 34 percent — meaning roughly one in three images stored had an identical or near-identical copy elsewhere in the same system. For the Lands Department's online mapping portal, which serves millions of property searches annually from its headquarters on Queensway, the duplication rate in one internal review reportedly exceeded 40 percent across its scanned cadastral document library, though the department has not publicly released the full breakdown.

The commercial sector tells a similar story. Hong Kong's two dominant property listing platforms together handle more than 1.2 million active listings at peak periods, each typically carrying between eight and twenty photographs. Industry estimates — drawn from cloud infrastructure billing data presented at a PropTech Hong Kong seminar held at Cyberport in April 2026 — suggest that deduplication alone could cut image-related cloud storage costs for mid-sized estate agency chains by between 18 and 25 percent annually. For a firm running 15 branches across Kowloon and the New Territories, that translates to savings in the range of HK$80,000 to HK$150,000 per year depending on current vendor contracts.

Media archives compound the problem differently. RTHK's digital library, which spans decades of broadcast stills and press photography, has been undergoing a structured deduplication exercise since January 2026 as part of its own modernisation programme. The broadcaster has not disclosed specific file counts, but procurement notices posted to the GovHK e-Tendering portal reference contracts for hash-based image matching software capable of processing batches of 500,000 files at a time.

Why Fixing It Is Harder Than It Sounds

Deduplication sounds simple — find the copies, delete them — but the technical reality is messier. Near-duplicate detection, which catches resized, cropped or colour-adjusted versions of the same original, requires perceptual hashing algorithms rather than straightforward checksum matching. Licensing those tools, or building them in-house, adds upfront cost that budget cycles at many Hong Kong statutory bodies are not structured to absorb quickly.

The Hong Kong Science and Technology Parks Corporation, which houses several AI and data management start-ups in its Pak Shek Kok campus in the New Territories, has been positioning local firms as vendors for exactly this kind of remediation work. At least three tenants there are actively pitching deduplication-as-a-service contracts to public sector clients, according to tender response documents circulated ahead of a GovTech procurement briefing held in June 2026.

For organisations not yet under a formal clean-up mandate, the practical calculus is straightforward. Cloud storage pricing from the major providers serving Hong Kong data centres — located primarily in Tseung Kwan O and Tsuen Wan — has not fallen as sharply as it did between 2018 and 2022. Holding costs for redundant data are no longer trivially cheap. Running a preliminary audit using open-source perceptual hashing tools costs relatively little; the Government Chief Information Officer's office has signalled it will publish a recommended toolkit for smaller bureaus before the third quarter of 2026 ends. Departments and firms waiting for that guidance have, at most, a few months before the 30-percent reduction target starts requiring evidence of progress.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Hong Kong

Covering news in Hong Kong. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Hong Kong news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Hong Kong and accept our Privacy Policy. Unsubscribe anytime.

Before you go

Get the Hong Kong brief

The day's Hong Kong news in a 2-minute read. Free, weekday mornings.

No spam. Unsubscribe anytime.