Skip to content

Error attempting to pull meteorite data from NASA #3

@simonw

Description

@simonw

https://data.nasa.gov/Space-Science/Meteorite-Landings/gh4g-9sfh

/tmp $ socrata2sql insert data.nasa.gov gh4g-9sfh
WARNING:root:Requests made without an app_token will be subject to strict throttling limits.

Connecting to database
  ▶ Using default SQLite database "sqlite:///meteorite_landings.sqlite".
  ▶ Query "SELECT PostGIS_version();" failed. Geometry columns will be skipped.

Setting up new table, "meteorite_landings", from Socrata API fields
  ▶ "geolocation" is a location column but your database doesn't support PostGIS so it'll be skipped.
  ▶ Loading from API ◉◉◉◉◉◉◉◉◉◉◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯◯ 32%Traceback (most recent call last):
  File "/usr/local/lib/python3.7/site-packages/socrata2sql/parsers.py", line 11, in parse_datetime
    return datetime.strptime(str_val, "%Y-%m-%dT%H:%M:%S.%f")
  File "/usr/local/Cellar/python/3.7.2_1/Frameworks/Python.framework/Versions/3.7/lib/python3.7/_strptime.py", line 577, in _strptime_datetime
    tt, fraction, gmtoff_fraction = _strptime(data_string, format)
  File "/usr/local/Cellar/python/3.7.2_1/Frameworks/Python.framework/Versions/3.7/lib/python3.7/_strptime.py", line 359, in _strptime
    (data_string, format))
ValueError: time data '-0300-01-01T00:00:00' does not match format '%Y-%m-%dT%H:%M:%S.%f'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/bin/socrata2sql", line 11, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.7/site-packages/socrata2sql/cli.py", line 299, in main
    to_insert.append(Binding(**parse_row(row, Binding)))
  File "/usr/local/lib/python3.7/site-packages/socrata2sql/cli.py", line 258, in parse_row
    parsed[col_name] = parsers[mapper_col_type](col_val)
  File "/usr/local/lib/python3.7/site-packages/socrata2sql/parsers.py", line 14, in parse_datetime
    return datetime.strptime(str_val, "%Y-%m-%dT%H:%M:%S")
  File "/usr/local/Cellar/python/3.7.2_1/Frameworks/Python.framework/Versions/3.7/lib/python3.7/_strptime.py", line 577, in _strptime_datetime
    tt, fraction, gmtoff_fraction = _strptime(data_string, format)
  File "/usr/local/Cellar/python/3.7.2_1/Frameworks/Python.framework/Versions/3.7/lib/python3.7/_strptime.py", line 359, in _strptime
    (data_string, format))
ValueError: time data '-0300-01-01T00:00:00' does not match format '%Y-%m-%dT%H:%M:%S'

It looks like this is due to invalid data, but it would be nice if socrata2sql could either show a nicer error message or had an option to report and skip invalid rows.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions