When matching patterns, knowing which regular expression has matched the token, when can know its type. We can then store it So as to do it, we first need to cut the tokens already retrieved (HTML tags and words) using separators in order to obtain the final tokens (smaller et more precise). We (...) >continue
Regular expressions are precise and powerful tools that enable to match patterns in text. They are going to be useful there, to erase the code we don’t want (commentaries, java scripts, style sheets etc.), and to detect the tokens we want to retrieve and store.
We want to retrieve HTML (...) >continue