Skip to main content
The Daily Dallas

All of Dallas, every day

News

Dallas City Records Show Thousands of Duplicate Images Clogging Public Digital Archives

A quiet data-cleanup effort across Dallas city departments is exposing just how much storage space — and taxpayer money — redundant digital files have quietly consumed for years.

Share

By Dallas News Desk · Published 4 July 2026, 2:00 PM

4 min read

Updated 5 h ago· 4 July 2026, 10:03 PM

How we reported this

This article was generated by AI from the linked public sources. The Daily Dallas is independently owned and covers Dallas news free from advertiser or sponsor influence. Read our editorial standards →

Dallas City Records Show Thousands of Duplicate Images Clogging Public Digital Archives
Photo: United States. Department of Agriculture / Public domain (Wikimedia Commons)

Dallas city IT administrators are working through a backlog of more than 340,000 duplicate image files spread across municipal servers, a figure drawn from an internal audit completed in late May 2026 by the Office of Budget and Management's technology division. The cleanup, which targets everything from duplicated permit photographs to copied infrastructure inspection scans, is part of a broader digital records modernization push the city launched under its FY2026 operational budget.

The timing matters. Dallas is midway through a $47 million enterprise resource planning overhaul that touches nearly every department filing system, from Dallas Water Utilities to the Development Services Department on South Lamar Street. Officials managing that transition have flagged redundant image data as one of the top friction points slowing the migration — every duplicate file has to be reviewed, matched, and resolved before records can move cleanly to the new system.

The Numbers Behind the Problem

The scale is considerable. According to figures from the city's IT Services division, duplicate images account for roughly 18 percent of total unstructured data stored on municipal servers as of the May audit. Across all departments, the city maintains approximately 1.9 petabytes of stored data. That means the duplicate image problem alone represents an estimated 342 terabytes of redundant storage — capacity that costs the city money to maintain whether the files are useful or not.

Cloud and on-premise storage contracts reviewed as part of the budget process show Dallas pays blended storage rates that work out to roughly $22 per terabyte per month for actively managed data tiers. By that measure, the duplicate image backlog has been costing the city an estimated $7,500 per month in unnecessary storage overhead — or close to $90,000 annually — before any deduplication work began.

The problem is most acute inside the Development Services Department, which processes building permits and inspection records for neighborhoods across the city, from Oak Cliff to Lake Highlands. Field inspectors upload site photographs directly from mobile devices, and without an automated deduplication layer in place, the same image sometimes appears two, three, or even four times in a single project file. The department processed more than 61,000 permit applications in calendar year 2025, each with associated photo documentation.

Dallas Water Utilities, headquartered on North Harwood Street, faces a similar issue with pipeline inspection imagery collected by crews working across the city's roughly 5,000 miles of water and wastewater mains. Infrastructure photographs taken during routine work orders are frequently uploaded multiple times when field crews lose and then regain connectivity, creating duplicate entries that complicate both record searches and long-term infrastructure trending analysis.

What Cleanup Actually Looks Like

The city contracted with a records management vendor in March 2026 to run automated hash-comparison tools across the affected server directories. That process identifies files with identical digital signatures regardless of filename or folder location. As of the May audit, the automated pass had flagged 341,800 confirmed duplicate images for human review. Staff inside the city's 311 Service Center office on South Ervay Street have been seconded part-time to work through the review queue, prioritizing active project files first.

Progress has been incremental. The review team cleared approximately 74,000 files in the first six weeks of active cleanup — a pace that, if maintained, puts full resolution sometime in the fourth quarter of 2026. The freed storage is expected to be reallocated within the new enterprise system rather than reducing the city's contracted storage ceiling, at least in the short term.

For Dallas residents interacting with city services, the practical implication is faster record searches and fewer errors when pulling permit histories for properties. Homeowners in neighborhoods like Lakewood or Preston Hollow who have had repeated dealings with Development Services over renovation work are most likely to notice improved response times when inspectors pull up project photo logs. The city's IT Services division has indicated it plans to implement an upload deduplication layer before the enterprise migration goes live — a step that would prevent the problem from rebuilding itself once the cleanup is complete.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Dallas

Covering news in Dallas. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Dallas news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Dallas and accept our Privacy Policy. Unsubscribe anytime.