- Initial version. Support for snapshot archives in main-memory and on file system.
- Allow different types of input documents (e.g., CSV files or Json).
- External merge-sort for large CSV files.
- Add managers for maintaining sets of archives.
- Proper handling of date/time objects by the default archive reader and writer.
- Optional arguments for Json encoder and decoder for persistent archives.
- Add encoder and decoder information to archive manager metadata.
- Simple command-line interface for persistent archive manager.
- Add archive manager that maintains descriptors in a relational database (#8).
- Add index position information to column class (#11).
- Add
__getitem__
andget()
method toSnapshotListing
.
- Include wrapper for CSV files.
- Commit CSV files directly to a HISTORE archive.
- Fix bug when adding snapshot from file without primary key (#19).
- Add default Json encoder and decoder for
ArchiveFileStore
. - Add optional operation descriptor to snapshots (#21).
- Add support for archive rollback.
- Disable type inference when checking out dataset snapshot as data frame (#24).
- Add more compact archive serialization option.
- Add option to select archive serializer (#27).
- Add option to commit dataset snapshot from a data stream.
- Add
histore.archive.reader.SnapshotReader
(ahistore.document.base.Document
implementation) to read dataset snapshots. - Add close method to
histore.archive.reader.ArchiveReader
interface. - Change behavior of
histore.document.schema.to_schema()
to take existing Column objects into account. - Direct update of archive snapshots via
apply()
andhistore.document.operator.DatasetOperator
. - Require archives to be created from initial snapshot if primary key is used.
- Add
histore.document.json.base.JsonDocument
to read serialized Json documents. - Use user's cache directory as the default parent directory for archive managers.
- Remove option for partial merge.
- Rename type-hint
Schema
toDocumentSchema
. - Add empty document class
histore.document.mem.Schema
. - Change format of serialized archive JSON files.
- Change internal representation of timestamps.
- Handle empty CSV files properly.