summaryrefslogtreecommitdiffstats
path: root/CHANGELOG
diff options
context:
space:
mode:
Diffstat (limited to 'CHANGELOG')
-rw-r--r--CHANGELOG702
1 files changed, 702 insertions, 0 deletions
diff --git a/CHANGELOG b/CHANGELOG
new file mode 100644
index 0000000..a0fe07a
--- /dev/null
+++ b/CHANGELOG
@@ -0,0 +1,702 @@
+orcus 0.19.2
+
+* general
+
+ * fixed a build issue with gcc 14 due to a missing include for std::find_if
+ and std::for_each.
+
+ * fixed a segmentation fault with the orcus-test-xml-mapped test which
+ manifested on hppa hardware, as originally reported on
+ https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1054376.
+
+* xls-xml
+
+ * fixed a crash when loading a document that includes a style record
+ referencing an unnamed style record as its parent. In Excel-generated
+ documents, styles only reference named styles as their parents. But in
+ 3rd-party generated documents, styles referencing unnamed styles as their
+ parents can occur.
+
+* gnumeric
+
+ * fixed a crash when the document model returned a null pointer when a
+ reference resolver interface was requested.
+
+orcus 0.19.1
+
+* general
+
+ * implemented orcus::create_filter() which instantiates a filter object of
+ specified type. The returned object is of type
+ orcus::iface::import_filter.
+
+ * moved test cases for format detection to the respective filter test files.
+
+* gnumeric
+
+ * fixed a bug where the import filter did not set the formula grammer prior
+ to importing.
+
+orcus 0.19.0
+
+* general
+
+ * added support for allowing use of std::filesystem,
+ std::experimental::filesystem or boost::filesystem per build
+ configuration.
+
+* xlsx
+
+ * refactored styles import to use style indices returned by the document
+ model implementer rather than using the indices stored in the file. This
+ allows the implementer to aggregate some style records and re-use the same
+ index for records that are stored as different records in the original
+ file.
+
+* xls-xml
+
+ * fixed a bug where column styles were not applied to the correct columns
+ when the starting column index was not 0.
+
+* gnumeric
+
+ * overhauled the Gnumeric import filter to fix many bugs and support many
+ missing features relative to the other filters included in orcus. Most
+ notable mentions are:
+
+ * cell styles
+
+ * rich-text strings
+
+ * named ranges
+
+ * row heights and column widths
+
+ * merged cells
+
+* parquet
+
+ * added partial support for Apache Parquet import filter. This is still
+ heavily experimental.
+
+orcus 0.18.1
+
+* sax parser
+
+ * added support for optionally skipping multiple BOM's in the beginning of
+ XML stream. This affects all XML-based file format filters such as
+ xls-xml (aka Excel 2003 XML).
+
+* xml-map
+
+ * fixed a bug where XML documents consisting of simple single-column records
+ were not properly converted to sheet data.
+
+* xls-xml
+
+ * fixed a bug where the filter would always pass border color even when it
+ was not set.
+
+* buildsystem
+
+ * added new configure switches --without-benchmark and --without-doc-example
+ to optinally skip building of these two directories.
+
+orcus 0.18.0
+
+* general
+
+ * fixed the flat output mode to properly calculate the lengths of UTF-8
+ encoded strings.
+
+ * replaced all uses of std::strtol() to parse_integer() to properly parse
+ strings that are not necessarily null-terminated.
+
+ * added a new output format type 'debug-state' which dumps the internal
+ state of the populated document model in detail. This can be useful
+ during debugging.
+
+ * separated the import_shared_string interface implementation from the
+ backend shared strings store per separation of responsibility.
+
+ * merged the foo_t and foo_active_t struct pair, such as font_t and
+ font_active_t, in the styles store into a single type using std::optional.
+
+ * revised the documentation and public API and cleaned things up where
+ necessary.
+
+* ods
+
+ * reimplemented the number format styles import to correctly keep track of
+ element stacks and correctly perform structure checks to detect malformed
+ documents.
+
+ * added new interface to import named styles applied to columns.
+
+ * added new interface to import attributes for asian and complex scripts for
+ the folloiwng font attributes:
+
+ * font name
+
+ * font size
+
+ * font style
+
+ * font weight
+
+ * re-designed the styles import interface to make it multi-level.
+
+ * re-worked the import of the style:text-underline-width attribute to make
+ its handling more in line with the specifications.
+
+* xls-xml
+
+ * added support for importing wrap-text and shrink-to-fit cell format
+ attributes.
+
+ * added support for importing cell-hidden and locked attributes.
+
+ * added support for importing direct and named cell formats applied to
+ columns and rows.
+
+* xlsx
+
+ * added support for importing wrap-text and shrink-to-fit cell format
+ attributes.
+
+ * added support for importing direct and named cell formats applied to
+ columns and rows.
+
+* xml-map
+
+ * added a new interface to pass the encoding information to the document
+ model so that it can correctly decode non-UTF-8-encoded string values.
+
+orcus 0.17.2
+
+* ods
+
+ * fixed a bug where the state of style:cell-protect="none" was not
+ explicitly pushed, thereby having had the same effect as not having this
+ attribute. After the fix, style:cell-protect="none" will explicitly push
+ the hidden state to false, locked state to false, and the formula-hidden
+ state to false.
+
+orcus 0.17.1
+
+* general
+
+ * addressed a number of coverity issues.
+
+ * removed a variety of compiler warnings.
+
+* ods
+
+ * re-generated sax parser tokens from ODF v1.3.
+
+ * revised the style import code to only push style attributes that are
+ actually specified in the XML.
+
+* xls-xml
+
+ * revised the XML structure validation strategy to ignore any mis-placed
+ elements and their sub structures rather than aborting the import.
+
+orcus 0.17.0
+
+* general
+
+ * set the baseline C++ version to 17.
+
+ * cleaned up the public API to replace pstring with std::string_view, union
+ with std::variant, and boost::optional with std::optional. With this
+ change, the public API no longer has dependency on boost.
+
+* spreadsheet document
+
+ * switched to using ixion::model_iterator for horizontal iteration of cells
+ instead of using mdds::mtv::collection.
+
+ * fixed a bug where exporting a spreadsheet document containing adjacent
+ merged cells regions to html incorrectly exported the merged cell areas.
+
+* xlsx
+
+ * cached cell values are now correctly loaded from the file.
+
+* sax parser
+
+ * utf-8 names are now allowed as element and attribute names.
+
+* css parser
+
+ * unquoted utf-8 property values are now allowed.
+
+* orcus-json
+
+ * fixed segmentation fault when using --mode structure with the Windows
+ build.
+
+ * added yaml output option.
+
+* xml-map
+
+ * fixed a bug where mapping of an XML document with namespace aliases
+ sometimes corrupts the alias values.
+
+* python
+
+ * added orcus.FormulaTokenOp enum type which describes type formula token
+ operator types in a more finer grained manner.
+
+* documentation
+
+ * added notes to how to use orcus-xml and orcus-json to map XML and JSON
+ documents to spreadsheet documents.
+
+orcus 0.16.1
+
+* fixed a build issue on 32-bit linux platforms, which was indirectly caused
+ by ixion.
+
+* fixed json parsing bug caused by an uninitialized variable, which manifested
+ itself on debian 32-bit platform.
+
+* removed compiler warnings on unused variables from the base parser handlers.
+
+orcus 0.16.0
+
+* general
+
+ * full formula recalculations are now optional when loading documents. It
+ makes more effective use of cached formula results.
+
+ * added the option of failing on the first faulty cell, or skipping them.
+
+ * fixed a bug that caused the threaded_sax_token_parser to deadlock.
+
+ * added base parser handler classes in the public headers so that they can
+ be sub-classed to overwrite necessary handler methods.
+
+* json-parser
+
+ * parsing of numeric values are now more strict for better conformance to
+ the specs.
+
+* ods
+
+ * added support for loading named expressions from ods documents.
+
+ * fixed an infinite loop when loading one of the attached ods documents from
+ https://bugs.documentfoundation.org/show_bug.cgi?id=82414
+
+* xlsx
+
+ * fixed a segfault when loading the xlsx document from
+ https://bugs.documentfoundation.org/show_bug.cgi?id=83711.
+
+* xls-xml
+
+ * fixed a bug that prevented formulas from referencing cells located in
+ later sheets.
+
+* xml-map
+
+ * adjusted the xml path expressions to be more like XPath. Previously, an
+ attribute was expressed as '@' in the old expression, but XPath uses '/@'.
+ The new expression uses '/@' for an attribute.
+
+ * added the ability to identify and import ranges from XML documents without
+ map file.
+
+ * added the ability to generate map file from XML documents for user
+ customization.
+
+ * added support to specify default namespace in the map file.
+
+* python
+
+ * added orcus.Cell class to represent individual cell values and attributes.
+
+ * fixed several memory leaks in the python binding layer.
+
+ * modified orcus.csv.read() function to take string input, instead of bytes.
+
+ * added __version__ attribute to the orcus module.
+
+ * cleaned up orcus.detect_format function to only take the stream parameter.
+
+ * added named_expressions properties to Document and Sheet class objects.
+
+ * added Python API to bulk-process a number of spreadsheet documents
+ (orcus.tools.file_processor).
+
+ * added Python API to download attachments from bugzilla services via REST
+ API (orcus.tools.bugzilla).
+
+orcus 0.15.4
+
+* fixed a build error with gcc 10 with LTO. For more details, visit
+ (https://bugs.gentoo.org/715154).
+
+* removed potentially non-free specification and schema files from the
+ package.
+
+orcus 0.15.3
+
+* xml-map
+
+ * fixed another bug related to filling of cells down the column in a linked
+ range with nested repeat elements. The bug would occur when the field in
+ a linked range is more than one level deeper than the nearest row group
+ element.
+
+* xls-xml
+
+ * fixed a bug where TopCell and LeftCell attributes of the Table element
+ were not properly honored.
+
+orcus 0.15.2
+
+* xml-map
+
+ * fixed a bug that prevented filling of cells down the column in a linked
+ range with nested repeat elements. The bug would occur when the field in
+ a linked range is associated with an element content rather than an
+ attribute.
+
+* xls-xml
+
+ * added code to properly pick up and pass the number format codes, including
+ named number format values such as 'General Date', 'Long Time, 'Currency'
+ etc.
+
+* fixed a build issue on older macOS environment, related to passing an rvalue
+ to a tuple expecting a const reference. The root cause was a bug in libc++
+ of LLVM < 7.
+
+* fixed a build issue with gcc5.
+
+orcus 0.15.1
+
+* switched xml_map_tree to using boost::object_pool to manage the life
+ cycles of the objects within xml_map_tree, to avoid memory
+ fragmentation.
+
+* fixed incorrect handling of newly created elements in xml_map_tree.
+
+* fixed segfault caused by double deletion of allocated memory for
+ xml_map_tree::element, which seemed to happen only on 32-bit gcc builds.
+
+* fixed weird test failures related to equality check of two double-precision
+ values, caused probably by aggressive compiler optimization which only seems
+ to get triggered in 32-bit gcc builds.
+
+orcus 0.15.0
+
+* spreadsheet interface
+
+ * import_sheet::fill_down_cells() has been added as a required method, to
+ allow the import filter code to duplicate cell value downward in one step.
+
+* json parser
+
+ * added test cases from JSONTestSuite.
+
+ * fixed a bug on parsing an empty array containing one or more blank
+ characters between the brackets.
+
+* sax parser
+
+ * fixed a bug on parsing an attribute value with encoded character
+ immediately followed by a ';', such as '&amp;;'.
+
+ * fixed a bug on parsing an assignment character '=' that either preceded or
+ followed by whitespaces in attribute definition.
+
+ * optionally use SSE4.2 intrinsics to speed up element name parsing.
+
+* orcus-xml
+
+ * revised its cli interface to make use of boost's program_options.
+
+ * orcus-xml-dump's functionality has been combined into orcus-xml.
+
+ * map mode now supports nested repeat elements to be mapped as range fields.
+
+* orcus-json
+
+ * map mode has been added to allow mapping of JSON documents to spreadsheet
+ document model. This mode either takes explicit mapping rule via map
+ file, or performs automatic mapping by auto-identifying mappable ranges by
+ analyzing the structure of the JSON document.
+
+ * structure mode has been added to display the logical structures of JSON
+ documents.
+
+ * significantly improved performance of json document tree by utilizing
+ object pool to manage the life cycles of json value instances.
+
+* xls-xml
+
+ * added support for importing named color values in the ss:Color attributes.
+
+ * added support for handling UTF-16 streams that contains byte order marks.
+
+* spreadsheet document
+
+ * significantly improved performance of flat format output generation.
+
+* internal
+
+ * string_pool now uses boost's object_pool to manage the instances of stored
+ strings.
+
+ * file_content class has been added to memory-map file contents instead of
+ loading them in-memory.
+
+ * memory_content class has been added to map in-memory buffer with the
+ optional ability to perform unicode conversion.
+
+ * dom_tree has been renamed to dom::document_tree, and its interface has
+ been cleaned up to hide its implementation details.
+
+orcus 0.14.1
+
+* addressed a number of coverity issues.
+
+* improved precision of points-to-twips measurement conversions by
+ reducing the number of numeric operations to be performed. This
+ especially helps on i386 platforms.
+
+orcus 0.14.0
+
+* spreadsheet interface
+
+ * import_data_table::set_range() now receives a parameter of type
+ range_t.
+
+ * import_sheet::set_array_formula() interface methods have been
+ removed and replaced with import_sheet::get_array_formula() that
+ returns an interface of type import_array_formula.
+
+ * import_formula interface class has been added to replace the
+ formula related methods of import_sheet. As a result,
+ set_formula(), set_shared_formula(), and set_formula_result()
+ methods have been removed from the import_sheet interface class.
+
+ * import_auto_filter::set_range() now receives a parameter of type
+ range_t, rather than a string value representing a range.
+
+ * import_sheet::set_fill_pattern_type() interface method now takes
+ an enum value of type fill_pattern_t, rather than a string value.
+
+* xls-xml
+
+ * pick up the character set from the XML declaration, and pass it
+ to the client app via import_global_settings interface.
+
+ * support importing of array formulas.
+
+* xlsx
+
+ * support importing of array formulas.
+
+ * fixed a bug where sheet indices being passed to the append_sheet()
+ interface method were incorrect.
+
+* shared formula handling code has been re-worked.
+
+* spreadsheet::sheet class has been de-coupled from the import and
+ export interfaces.
+
+* previously known as import_styles class is now split into styles
+ class and import_styles factory wrapper class.
+
+* sax_parser now gracefully ignores leading whitespace(s) if any,
+ rather than aborting the parsing for it's not a valid XML stream
+ to have leading whitespace(s). In the future we should make this
+ behavior configurable.
+
+* python
+
+ * add orcus.xlsx.read() function that takes a file object to load
+ an xlsx file as a replacement for orcus.xlsx.read_file().
+
+ * add orcus.ods.read(), orcus.xls_xml.read(), orcus.csv.read(),
+ and orcus.gnumeric.read() functions.
+
+ * add orcus.Sheet.write() method which exports sheet content to
+ specified format. For now only the csv format type is
+ supported.
+
+* xml_map_tree no longer requires the source stream persisted in
+ memory between the read and write.
+
+* the sax parser now stores the offset positions of each element
+ rather than their memory positions, in order to make the position
+ values usable between duplicated stream instances.
+
+* xml_structure_tree to support selection of an element by element
+ path.
+
+* document
+
+ * correctly set the argument separator depending on the formula
+ grammar type. This change fixes loading of ods documents with
+ formula cells.
+
+* fixed a build issue with boost 1.67.
+
+orcus 0.13.4
+
+* xls-xml
+
+ * fix incorrect handling of formula cells without result caches.
+
+* fix incorrect parsing of invalid XML documents with multiple
+ self-closing root elements.
+
+orcus 0.13.3
+
+* fix the handling of alpha values passed to set_fill_fg_color() and
+ set_fill_bg_color(). A value of 255 means fully opaque whereas a
+ value of 0 means fully transparent.
+
+* fix the solid fill color import, to use the foreground color
+ instead of the background color.
+
+* xlsx
+
+ * import colors to disgonal borders.
+
+ * remove carriage returns from multi-line cell strings.
+
+* xls-xml
+
+ * import border colors.
+
+ * import hidden row and column flags.
+
+orcus 0.13.2
+
+* xls-xml
+
+ * import column width and row height properties.
+
+ * import solid fill colors in cells.
+
+ * import text alignment properties.
+
+ * import cell borders.
+
+* xlsx
+
+ * import justified and distributed text alignment properties.
+
+ * fix exception being thrown when the diagonal element is
+ encountered.
+
+ * import diagonal cell borders.
+
+orcus 0.13.1
+
+* use a more efficient way to set format ranges in spreadsheet
+ model.
+
+* support single quoted strings in the css parser.
+
+orcus 0.13.0
+
+* fix incorrect parsing of XML 1.0 documents that don't include
+ header declarations.
+
+* fix incorrect parsing of XML elements and attributes whose names
+ start with an underscore.
+
+* orcus-csv: add an option to split content into multiple sheets in
+ case it doesn't fit in one sheet.
+
+* add csv dump mode for all spreadsheet document based filter
+ commands.
+
+* orcus-ods: suppress debug outputs unless the debug flag is set.
+
+* orcus-xlsx: correctly import boolean cell values.
+
+* add experimental cmake-based build support, primarily for Windows.
+
+* add initial support for importing select sheet view settings in
+ xlsx and xls-xml.
+
+* add API for directly constructing json document trees.
+
+* support import of cell formats for xls-xml.
+
+* support single-quoted attribute values in the sax xml parser.
+
+* orcus-xml: fix incorrect mapping of XML range data to sheet range
+ when the first column contains one or more empty elements.
+
+* support import of named expressions for xlsx and xls-xml.
+
+* support import of formula cells for xls-xml.
+
+* implement pivot cache import for xlsx.
+
+* fix a number of crashes in the csv parser.
+
+* fix a number of crashes in the json parser.
+
+* fix a number of crashes in the yaml parser.
+
+* integrate orcus into gitlab's continuous integration.
+
+orcus 0.12.1
+
+* fix build when --disable-spreadsheet-model is specified and the
+ ixion headers are not present.
+
+* fix loading of file streams on Windows.
+
+* get the debug flag to work again.
+
+orcus 0.12.0
+
+* handle escaped unicode in the xml parser
+
+* improve odf styles import
+
+* implement threaded xml parser
+
+* implement threaded json parser
+
+orcus 0.11.2
+
+* make it buildable with mdds-1.2.
+
+orcus 0.11.1
+
+* fixed various build issues with MSVC and clang on OSX.
+
+orcus 0.11.0
+
+* remove boost dependency from the public headers.
+
+* implement JSON parser and document storage model.
+
+* implement YAML parser and document storage model.
+
+* add orcus-json.
+
+* add orcus-yaml.
+
+* improve parse error output from the XML parser.
+
+* use enum class in import_style::set_border_style().
+
+* support non-local file import.
+
+orcus 0.1.0
+
+* initial release.