GCloud Blog scraper
Scrape the content of a post from GCloud Blog. This will return a Zip file containing the following
- body.txt - This is what you paste in the WP post. This has all the anchor/links updated with target='_blank'
- footer.txt - The author section, with anchor/link to the source
- <Images> - Images that have been formatted to JPG and with formatted file names ready for upload to WP
The following are disclaimers or things for consideration when using the scraper
- YouTube links are not included in the body.txt. You need to add this manually
- The images included in the body.txt still refers to the source page. The generated images must be uploaded then replace these images at the WP.
- Some special characters may appear in the page, once content of the body.txt have been copied to the WP post body. This needs to be updated manually. This rarely occurs. Examples are special double quotes, single quote and hyphen that are of different kind.
SD Times
Scrape the content of a post from SD Times. This will return a Zip file containing the following
- body.txt - This is what you paste in the WP post. This has all the anchor/links updated with target='_blank'
- footer.txt - The author section, with anchor/link to the source
- <Images> - Images that have been formatted to JPG and with formatted file names ready for upload to WP
The following are disclaimers or things for consideration when using the scraper
- YouTube links are not included in the body.txt. You need to add this manually
- The images included in the body.txt still refers to the source page. The generated images must be uploaded then replace these images at the WP.
Tech Republic
Scrape the content of a post from Tech Republic. This will return a Zip file containing the following
- body.txt - This is what you paste in the WP post. This has all the anchor/links updated with target='_blank'
- takeaway.txt - This is an introduction sentence/paragraph
- footer.txt - The author section, with anchor/link to the source
- <Images> - Images that have been formatted to JPG and with formatted file names ready for upload to WP
The following are disclaimers or things for consideration when using the scraper
- YouTube links are not included in the body.txt. You need to add this manually
- The images included in the body.txt still refers to the source page. The generated images must be uploaded then replace these images at the WP.
CNCF Blog
Scrape the content of a post from Cloud Native Computing Foundation. This will return a Zip file containing the following
- body.txt - This is what you paste in the WP post. This has all the anchor/links updated with target='_blank'
- footer.txt - The author section, with anchor/link to the source
- <Images> - Images that have been formatted to JPG and with formatted file names ready for upload to WP
The following are disclaimers or things for consideration when using the scraper
- YouTube links are not included in the body.txt. You need to add this manually
- The images included in the body.txt still refers to the source page. The generated images must be uploaded then replace these images at the WP.