better structure test suite to make clear what's required of CSL implementers #17

bdarcus · 2020-05-27T12:11:38Z

This arises from #13 and #16. What I concluded on the former issue is:

... the best immediate path on the test-suite is to create a separate subdirectory and start to move tests like this to that, and while doing it, add some metadata to these that tag them as such, maybe with some description. We could start with this one.

What I'm contemplating is we need to add three additional parameters or sections to the tests; perhaps:

version (with values like "1.0", "1.1", "1.0-M"; that sort of thing)
description (ideally we write this in spec language)
section (something to group tests together)

My thought is that we do this in such a way that a processing spec can (at least mostly) be auto-assembled without additional work.

If we do this right, the tests themselves become definitive, very clear, and the documentation can be directly tied to them.

While it would take time, this seems the quickest path to addressing the concerns in the linked issue, in iterative ways that fit existing resource constraints.

I realize it's more complicated than this (for example, multiple tests will often describe one larger processing logic), but hopefully those are solvable problems.

Not sure about whether subdirs would be necessary with this approach, but it might wise to at least separate CSL-M?

@citation-style-language/schema-pr-reviewers

bdarcus · 2020-05-27T12:20:48Z

For illustration, this:

>>==== MODE ====>>
citation
<<==== MODE ====<<

... could become:

>>==== VERSION ====>>
1.0
<<==== VERSION ====<<
>>==== DESCRIPTION ====>>
Lorem ipsum dolor sit amet, consectetur adipiscing elit. In cursus ante erat, 
tempus ornare dolor gravida et. Morbi efficitur quam ac varius laoreet. 
Nullam rhoncus est in diam rhoncus ultricies. Nam ligula turpis, consequat 
non dapibus in, vulputate in est. Praesent porttitor ac quam vel convallis. 
<<==== DESCRIPTION ====<<
>>==== SECTION ====>>
locators
<<==== SECTION ====<<
>>==== MODE ====>>
citation
<<==== MODE ====<<

... or maybe we allow a YAML header of something like this (standard, and much more compact):

---
name: some test
mode: citation
versions:
   - 1.0
   - 1.1
section: locators
description: >
            Lorem ipsum dolor sit amet, consectetur adipiscing elit. In cursus ante 
            erat, tempus ornare dolor gravida et. Morbi efficitur quam ac varius laoreet. 
            Nullam rhoncus est in diam rhoncus ultricies. Nam ligula turpis, consequat 
            non dapibus in, vulputate in est. Praesent porttitor ac quam vel convallis.
---

The above would say the same behavior applies across both 1.0 and 1.1.

Perhaps we have a directory structure that would correspond to generated documents (though current file names already have this sort of structure); like:

/tests
     /introduction
     /citations
     /bibliographies
     /names

It does seems like the existing tests have enough info to largely automate this conversion as a first pass (version can be derived from embedded CSL, for example), and then human could add anything else as time permits, or they need to deal with a particular test or test section.

bdarcus · 2020-05-27T13:56:07Z

Hmm .... here's a problem with automating a key part of this; was hoping we could somehow test on the embedded CSL, but it turns out ...

> rg -c 'version="1.0"' processor-tests/humans | wc -l 
869

E.g. it seems that there's no version distinction based on csl vs csl-m?

denismaier · 2020-05-27T15:14:44Z

Maybe there are no csl-m tests in this repo?
At the citeproc-js repo you'll find test fixtures for csl-m, e.g. this one.

bdarcus · 2020-05-27T15:16:59Z

That would be great; if effectively there were only a handful that slipped through?

denismaier · 2020-05-27T15:18:59Z

... or maybe we allow a YAML header of something like this (standard, and much more compact):

Have you had a look a jest-csl?
That is used for testing styles, but the syntax is instructive.

Concering

versions:
   - 1.0
   - 1.1

Maybe something like this:

versions:
   min: 1.0
   max: 1.1

???

bdarcus · 2020-05-27T15:23:11Z

No, but that looks really promising! It uses YAML.

Where the hell is @cormacrelf anyway; seems we could use his help ;-)

Not just because of jest (and because I look forward to a complete citeproc-rs!), but because he's the most recent developer to work his way through a full CSL implementation.

Oh, and sure on your suggestion; I just floated a concrete strawman to get the ball rolling. Might be jest has some better ideas.

denismaier · 2020-05-27T15:44:35Z

That would be great; if effectively there were only a handful that slipped through?

I have to admit that I don't always understand which features require csl-m version attribute. Some csl-m features seem to work also with version="1.0.".

bdarcus · 2020-05-27T15:47:32Z

This test viewer is pretty nice too:

https://cormacrelf.github.io/citeproc-rs-test-viewer/

Am curious about those "ignored" ones.

And then there's this:

https://github.com/cormacrelf/citeproc-rs#running-the-csl-test-suite

The yaml format?

https://github.com/cormacrelf/citeproc-rs/tree/cd73f28945a980e984e73163f6f59e513336c570/crates/citeproc/tests/data/humans

bdarcus · 2020-05-27T15:56:55Z

His "ignore" list, with explanation:

https://github.com/cormacrelf/citeproc-rs/blob/cd73f28945a980e984e73163f6f59e513336c570/crates/citeproc/tests/data/ignore.txt

One possibility immediately, then, is to add his list to our test directory, with our preface?

Until #17 is addressed, this borrows the ignore.txt list from the citeproc-rs project, with explanation, to provide guidance to implementers on which tests they should ignore.

Until #17 is addressed, this borrows the ignore.txt list from the citeproc-rs project, and adds two additional tests identified in #16. This list identifies those tests that rely on undocumented modes in citeproc-js. The contents are simply one filename per line, for easy processing.

bdarcus · 2020-05-29T17:43:06Z

I'm going to close this for now, as I think we've done what we can quickly.

The bigger and broader changes (though it should be feasible to automate a lot of what I suggest I think) will be necessary as we move towards v1.1, but those will have to wait.

Partially addresses #17 and #25, this adds a "VERSION" field to the processor.py script. Syntax for the field value is: [version]:[tag].

fbennett · 2020-06-13T22:10:28Z

To confirm, tests that exercise CSL-M style code have a version of 1.1mlz1, and reside in the citeproc-js repo. Some features of CSL-M may be recognized by that processor in CSL mode, where they don't conflict with CSL behavior and I was too lazy to disable them in the processor code.

bdarcus · 2020-06-13T22:17:31Z

So at this point, @fbennett, the only issues in the files in this repo may be where spec and test need to be aligned?

And are you saying in the cslm test repo, you are also using a "version" variable?

fbennett · 2020-06-13T23:04:53Z

There may still be some things in there that need weeding out. If you come across any culprits, let me know and I'll move them out to the other repo.

On version markers and description, YAML would be good. Also, it might be helpful to both style and processor developers to assign tags to the fixtures, to indicate features that each is meant to exercise, maybe to indicate the "level" of CSL complexity reflected in the test, and to flag things that may want editorial attention (such as fixtures with unnecessarily verbose CSL). I guess something like that would start with a controlled and curated list of tag names.

bdarcus · 2020-06-14T01:52:26Z

YAML would be better, but how sensitive is all this to change?

fbennett · 2020-06-14T02:08:47Z

If a team dug into it, the choices of syntax and content constraints should be made carefully, of course.

bdarcus · 2020-06-14T10:54:15Z

So sounds like maybe I should add tags and description to the PR. I don't feel comfortable modifying beyond that, but it leaves room for others to build on it later (for example, converting to YAML, or whatever).

bdarcus assigned fbennett, rmzelle and adam3smith May 27, 2020

bdarcus added a commit that referenced this issue May 27, 2020

Add ignore.txt list for test suite

7e6cea2

Until #17 is addressed, this borrows the ignore.txt list from the citeproc-rs project, with explanation, to provide guidance to implementers on which tests they should ignore.

bdarcus mentioned this issue May 27, 2020

Add ignore.txt list to test suite #18

Merged

bdarcus added a commit that referenced this issue May 27, 2020

Add ignore.txt list for test suite

ff2671f

Until #17 is addressed, this borrows the ignore.txt list from the citeproc-rs project, with explanation, to provide guidance to implementers on which tests they should ignore.

bdarcus mentioned this issue May 29, 2020

What are the rules for putting spaces around prefixes and suffixes? #13

Open

bdarcus closed this as completed May 29, 2020

bdarcus mentioned this issue Jun 13, 2020

Page ranges with letters #25

Closed

bdarcus added a commit that referenced this issue Jun 13, 2020

Add version field to processor.py

9c89faa

Partially addresses #17 and #25, this adds a "VERSION" field to the processor.py script. Syntax for the field value is: [version]:[tag].

bdarcus mentioned this issue Jun 13, 2020

Add version, tags fields to script, tests #27

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better structure test suite to make clear what's required of CSL implementers #17

better structure test suite to make clear what's required of CSL implementers #17

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

denismaier commented May 27, 2020

bdarcus commented May 27, 2020

denismaier commented May 27, 2020

bdarcus commented May 27, 2020 •

edited

Loading

denismaier commented May 27, 2020

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 29, 2020

fbennett commented Jun 13, 2020 •

edited

Loading

bdarcus commented Jun 13, 2020 •

edited

Loading

fbennett commented Jun 13, 2020 •

edited

Loading

bdarcus commented Jun 14, 2020 via email

fbennett commented Jun 14, 2020

bdarcus commented Jun 14, 2020

better structure test suite to make clear what's required of CSL implementers #17

better structure test suite to make clear what's required of CSL implementers #17

Comments

bdarcus commented May 27, 2020 • edited Loading

bdarcus commented May 27, 2020 • edited Loading

bdarcus commented May 27, 2020 • edited Loading

denismaier commented May 27, 2020

bdarcus commented May 27, 2020

denismaier commented May 27, 2020

bdarcus commented May 27, 2020 • edited Loading

denismaier commented May 27, 2020

bdarcus commented May 27, 2020 • edited Loading

bdarcus commented May 27, 2020 • edited Loading

bdarcus commented May 29, 2020

fbennett commented Jun 13, 2020 • edited Loading

bdarcus commented Jun 13, 2020 • edited Loading

fbennett commented Jun 13, 2020 • edited Loading

bdarcus commented Jun 14, 2020 via email

fbennett commented Jun 14, 2020

bdarcus commented Jun 14, 2020

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

bdarcus commented May 27, 2020 •

edited

Loading

fbennett commented Jun 13, 2020 •

edited

Loading

bdarcus commented Jun 13, 2020 •

edited

Loading

fbennett commented Jun 13, 2020 •

edited

Loading