dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    SinBiota
total number of records online 136386 
- without coordinates
- georeferenced 136383 
- access to georeferenced data denied
- in the sea 15632 
- blank catalognumber
  smaller: 671   larger: 23618 [ gap ]  
repeated records
catalog number 127790 
duplicate records 30501 
collector's name and number
last update  -  error logs
of the collection:  29-11-2022 of dataCleaning:  05-12-2022
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 24209 suspect records
family not found
genus 63 suspect records
species 420 suspect records
subspecies not found
author not found
duplicate not found
other inconsistencies not found
annotations 40 annotations
locality data
inventory country - state - municipality
name of the country/state 1825 suspect records
outlier 302 suspect records
long/lat outside the world limit 393 suspect records
equal long/lat 99 suspect records
long or lat equal to zero not found
long/lat in the sea (Brazil) 880 suspect records
municipality name (Brazil) 14999 suspect records
coordinate unit analysis (Brazil) 240 suspect records
other inconsistencies not found

date collected
collected before 1930 1117 suspect records
identification year previous to date collected not found
suggestions for blank fields
long/lat (Brazil) 3 suggestions  
country/state name not found
municipality name (Brazil) not found

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA