Parser Settings

Parse your articles automatically. Set up specific rules for how BotTalk should parse your articles.

BotTalk Parser to work correctly, you must first configure it appropriately.

Parser Settings - BotTalk Audio CMS
Parser Settings - BotTalk Audio CMS
circle-info
  1. Activate the BotTalk parser by checking the checkbox: Enable automatic parsing of articles.

  2. Enter your website URL.

  3. Set the starting date. BotTalk Parser will convert all articles published after this date into speech. The parser will ignore all articles published before this date.

  4. Update speech on article text change: a news article gets updated on average 2,8 times. Please check this checkbox if you want to keep your written articles in synch with the audio editions.

triangle-exclamation

Article Publication Date Format

BotTalk needs an article publication date to correctly sort, deduplicate, and process content. The parser can usually detect this automatically, but you can also provide it manually when needed.

How BotTalk determines the publication date

1) Automatic detection (preferred)

BotTalk parser first tries to extract the article’s publication date from JSON-LD embedded in the page (typically a <script type="application/ld+json">…</script> block).

Common JSON-LD fields that may contain a publication date include:

  • datePublished

  • dateCreated

  • dateModified (used only if publication date is not available, depending on site structure)

If JSON-LD is present and contains a valid date, no additional configuration is required.

2) Manual date (fallback)

If the page does not contain JSON-LD (or it’s missing a usable date), you must provide the article publication date manually via extractor rules.

Use manual date only when:

  • There is no JSON-LD on the page, or

  • JSON-LD exists but does not include a valid datePublished (or equivalent), or

  • The JSON-LD date is incorrect and must be overridden.

It's recomended to use the <time> HTML tagarrow-up-right. But other tags are supported as well:


Whitelisting BotTalk Parser in Firewall

Sometimes BotTalk Parser can get blocked by your firewall settings.

You will see the error message in your browser console with 403 HTTP Error code, stating that BotTalk Parser can not access your website.

In order to fix this, please make sure to whitelist both BotTalk Parser IP and HTTP Headers:

circle-exclamation
circle-exclamation

Limiting TTS Costs

Avoiding to run into the risk of high TTS costs you can set a daily character limit. The limit is set by default at 100.000 characters.

triangle-exclamation

You can view the total number of characters used in the "Cost of Audification" statistics.

To specify the limit, click on the checkbox "Daily Character Limit". Afterwards the field "Specify Limit" appears and you can set up your custom limit.

Set up custom daily character limit

Last updated