
Statewide historical compliation
Geological Survey of NSW
We were pleased to be selected as a service provider to participate in the statewide NSW ARRP project. This involved the extraction and digitisation of historical data from legacy hard-copy reports. This required building bespoke applications to ensure extractions were completed with high precision and cost effectiveness. Data was then carefully validated to meet the department’s exacting standards and loaded directly into their GBIS database for public availability.
The ARRP program required digitisation of legacy geological reports and datasets spanning decades of exploration activity. The source material included scanned logs, PDFs, and mixed-format tables.
We applied OCR, structured extraction, and standardised metadata to deliver consistent datasets that could be loaded into their GBIS MS-SQL statewide drilling and geochemistry database.
A series of thorough validation workflows ensured high-quality data, meeting the exacting standards the client required.
Client
Geological Survey of NSW
Program
ARRP legacy capture
Coverage
Statewide
Format
Reports and scans
Challenge
- Mixed quality scans and inconsistent metadata
- High volume of historical reports
- Need for strict QA before publication
Approach
- OCR and parsing with structured templates
- Standardized metadata and controlled vocabularies
- Automated validation with manual QA review
Outcome
- Searchable archive of legacy exploration data
- Improved discoverability for regional studies
- Reusable pipeline for ongoing data capture
Services Delivered
Tooling
Impact Metrics
Reports processed
3,125
Collars extracted
141,521
Surface samples extracted
396,825
Assays extracted
546,177