Hong Kong's digital records crisis has a name, and it is unglamorous: duplicate image replacement. Across government portals, land registry archives, and the sprawling commercial databases that underpin the city's financial services sector, redundant and outdated image files have accumulated at scale. The question facing technologists, regulators, and business operators alike is no longer whether to act, but how — and who pays.
The pressure is immediate. The Land Registry, headquartered in Queensway Government Offices in Admiralty, processes tens of thousands of property document scans each month. Sources familiar with the registry's internal workflows — speaking in a professional capacity without authorisation to comment publicly — describe a backlog of duplicate scans dating to the early digitisation drives of the late 2000s. Meanwhile, the Hong Kong Monetary Authority's oversight of fintech platforms operating out of the International Finance Centre and Exchange Square has drawn renewed scrutiny to how financial institutions store, tag, and purge replicated know-your-customer imagery.
Why This Moment Is Different
Two converging forces have pushed duplicate image replacement from a back-office nuisance to a boardroom concern. First, Hong Kong's data storage costs are not abstract. Commercial colocation space in Tseung Kwan O's data centre corridor — home to facilities operated by several major telecoms and cloud providers — runs at a premium relative to comparable hubs in Singapore and Tokyo. Every terabyte of redundant image data carries a direct dollar cost. Second, the Personal Data (Privacy) Ordinance, enforced by the Office of the Privacy Commissioner for Personal Data on Quarry Bay's King's Road, imposes obligations on data handlers to ensure accuracy and avoid unnecessary retention. Duplicate images of identity documents, faces, and signatures sit in uncomfortable legal territory when they persist beyond their original purpose.
The Office of the Government Chief Information Officer — which coordinates digitisation policy across bureaux — published its Digital Blueprint update in 2024 setting a framework for government data hygiene. That blueprint identified image deduplication as a priority action item but stopped short of mandating a citywide timeline. The gap between policy aspiration and operational execution is where most of the current difficulty lives.
Estimates from technology consultancies working with Hong Kong government clients — figures not independently verified by this newspaper — suggest that duplicate image files can account for between 20 and 35 percent of total unstructured data storage in large public-sector deployments. Even at the conservative end, the storage cost implications for an organisation running petabyte-scale archives are material.
The Decisions That Cannot Wait
Three choices will define how this plays out. The first is whether to pursue automated deduplication tools or commission manual audits. Automated tools are faster and cheaper but carry a non-trivial error rate when distinguishing genuinely distinct images from near-duplicates — a particular problem for land survey photographs and identity document scans where minor differences carry legal weight. The Hospital Authority, which manages records across 43 public hospitals including Queen Mary in Pok Fu Lam and Prince of Wales in Sha Tin, faces exactly this dilemma with its patient imaging archives.
The second decision concerns governance: which body sets the deduplication standard? The OGCIO, the Privacy Commissioner's office, and individual bureaux each have partial jurisdiction. Without a single accountable lead, the risk is that each department implements incompatible protocols, creating a new layer of inconsistency on top of the old one.
The third decision is timing. Organisations that defer until the next government IT procurement cycle — typically aligned with the Financial Secretary's budget cycle, with the next full-year budget due in February 2027 — will find themselves operating under growing compliance pressure in the interim. The Personal Data (Privacy) Ordinance carries enforcement powers, and the Privacy Commissioner's office has increased its audit activity since 2023.
Practical steps available now include commissioning a data inventory — a straightforward exercise that maps where image files live, how many copies exist, and when each was last accessed. For private firms operating in Central or Kowloon Bay's commercial districts, that exercise can often be completed within a quarter without waiting for regulatory direction. What the city cannot afford is another year of drift while the duplicate count climbs.