Import text from internet location¶
Goal¶
Import text content located at one or more URLs for further processing with Orange Textable.
Procedure¶
Single URL¶
- Create an instance of URLs on the canvas.
- Open its interface by double-clicking on the created instance.
- Make sure the Advanced settings checkbox is not selected.
- In the URL field, type the URL whose content you want to import
(including the
http://
prefix). - In the Encoding drop-down menu, select the encoding that corresponds to this URL.
- Click the Send button (or make sure the Send automatically checkbox is selected).
- A segmentation covering the URL’s content is then available on the URLs instance’s output connections; to display or export it, see Cookbook: Text output.
Multiple URLs¶
- Create an instance of URLs on the canvas.
- Open its interface by double-clicking on the created instance.
- Make sure the Advanced settings checkbox is selected.
- If needed, empty the list of imported URLs by clicking the Clear all button.
- In the URL(s) field, enter the URLs you want to import (including the
http://
prefix), separated by the string ” / ” (space + slash + space); make sure they all have the same encoding (you will be able to add URLs that have other encodings later). - In the Encoding drop-down menu, select the encoding that corresponds to the set of selected URLs.
- Click the Add button to add the set of selected URLs to the list of imported URLs.
- Repeat steps 5 to 7 for adding URLs in other encoding(s).
- Click the Send button (or make sure the Send automatically checkbox is selected).
- A segmentation containing a segment covering each imported URL’s content is then available on the URLs instance’s output connections; to display or export it, see Cookbook: Text output.