-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixing cpe data split for cpe fields containing colon #1142
base: master
Are you sure you want to change the base?
Conversation
This doesn't seem to handle escaped backslash correctly:
We already have proper CPE splitting code in vulnupdater: |
Fancy doing this? Also should be nice to have an example of misparsing to make sure it's fixed afterwards. I guess the issue manifests as some incorrect problem? |
You are right, my change fixes the escaped colon but ignores the escaped backslash, and using a function like mentioned above would be more suitable. I might be inclined to try implementing that function, although it might require some further changes in the gentoo parser, as the program uses fixed fields for the cpe information:
which works for the currently used CPE2.2 format but would fail for the CPE2.3 format. It would probably be a good idea to extend the parser to support both CPE URI formats. Although, going forward, I would also include support for the WFN format, which makes field assignments much easier and would allow the definition of cpe information in three ways:
|
It depends on which format Gentoo (and other repos) currently uses. As far as I remember, we only need CPE string parsing for Gentoo (other repos with CPE info available provide it as separate fields), and Gentoo only uses 2.2 format. 2.3 format can be supported with the same splitting code with just an extra condition which determines the format by the starting components. I would not over-engineer by adding support for wfn which is not used anywhere, but if we need it at some point, it would make sense to switch to third party module such as cpe or cpe_utils. |
I can't speak for Gentoo, but at Liguros we currently provide the cpe information in the 2.2 format, which is the format we picked up from Gentoo. Switching over to 2.3 format would be a matter of a few minutes, as the information is generated automatically in our update pipeline. Thus we don't mind if we deliver the information in 2.2 or 2.3 format; even changing it to a more suitable format would not be out of the question. For now I would try to change the gentoo parser, adding the splitting code, with the possibility to parse 2.2 or 2.3 format. |
OK, it seems that I finally satisfied the checks :) I have added cpe.py with the parse function. gentoo.py is now using that function and should now support cpe information in 2.2 and 2.3 format. A test file with some test cases was added as well. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Won't it be more convenient to have cpe_parse
do the format distinguishing?
I'd make it return a dataclass of parsed cpe fields, which can be constructed out of slice of fields list, e.g.
@dataclass
class CPE:
vendor: str = '*'
product: str = '*'
...
def cpe_parse(cpe_str: str) -> CPE:
...
if res[1] == '2.3':
return CPE(res[3:])
else:
return CPE(res[2:])
OK, with my little knowledge of python, I think I was able to add the parse as part of a data class. Seems to work for the parser, although I am not so sure about the test cases. |
Hmm, the check currently fails in dependencies with:
Could you have a look at it? |
ba3fcf1
to
0e8c7b8
Compare
e868b44
to
b0bf5db
Compare
The fields for projects whose vendor and/or project fields contain a colon, like e.g.: perl:archive-tar or perl:convert-asn1, are not correctly split from the given URI binding.
Currently fields with escaped colons like archive::tar_project are also split at the colon. With this pull request, the escaped colons are ignored and the URI binding is only split at unescaped colons.