A feeble attempt on writing a browser and learning rust.
Note: code in the main branch is currently not stable and might not even compile.
_ | | __ _ ___ ___ _ _| |__ / _` |/ _ \/ __| | | | '_ \ | (_| | (_) \__ \ |_| | |_) | \__, |\___/|___/\__,_|_.__/ __/ | The Gateway to |___/ Optimized Searching and Unlimited Browsing
This repository is part of the GoSub browser project. Currently there is only a single component/repository (this one), but the idea will be that there are many other components that as a whole make up a full-fledged browser. Each of the components can probably function as something standalone (ie: html5 parser, css parser, etc).
In the future, this component (html5 parser) will receive through an API a stream of bytes and will output a stream of events. The events will be consumed by the next component and so on, until we can display something in a window/user agent. This could very well be a text-mode browser, but the idea is to have a graphical browser.
This is a work in progress. The current status is that the parser can parse a few html5 documents, but it is far from ready. The main goal is to be able to parse correctly all the tests in the html5lib-tests repository (https://github.com/html5lib/html5lib-tests). As soon as we can do this, we can try and see if we can generate a DOM tree and then we can start thinking about the next component (css parser).
This project uses cargo (https://doc.rust-lang.org/cargo/). To build the project, simply run:
This will create the following binaries and libs:
|gosub-engine||lib||The actual html5 parser/tokenizer|
|gosub-browser||bin||Dummy browser (see below)|
|parser_test||bin||A test suite for the parser|
|tokenizer_test||bin||A test suite for the tokenizer|
This is the actual html5 parser/tokenizer. It is a library that can be used by other projects. It is not a standalone project. It is used by the gosub-browser project.
This is a dummy browser. It is not a real browser, it is just a test project to see if the parser/tokenizer is working and tries to parse the given url on the command line:
$ gosub-browser https://www.google.com
This is a test suite for the parser. It is not a standalone project. It is used by the gosub-engine project. You need to specify the directory to the html5lib-test in order to run, or it will use the default one (./html5lib-tests).
$ parser_test /path/to/html5lib-tests