There's a common understanding that new models come out starting in fall or later, so a (new) vehicle bought at the end of 2014 like the first row in the dataset will actually be a 2015 model. So the dataset is good.
In case folks are interested, we posted an issue on Github on the LanceDB side and got a nice conversation going that helps explain what is happening. If you're eager to index these kinds of datasets there's a bunch of remedies listed here: github.com/lancedb/lancedb/issues/1222
There's a common understanding that new models come out starting in fall or later, so a (new) vehicle bought at the end of 2014 like the first row in the dataset will actually be a 2015 model. So the dataset is good.
Ah! That makes sense.
In case folks are interested, we posted an issue on Github on the LanceDB side and got a nice conversation going that helps explain what is happening. If you're eager to index these kinds of datasets there's a bunch of remedies listed here:
github.com/lancedb/lancedb/issues/1222