Forum Discussion
In Public Beta: Source Content Image Extraction
Thanks so much for sharing your feedback, TerryPurtell and ISa, that’s great to hear!
It’s really encouraging to see the feature working smoothly and already adding value, especially with how much time it can save when building courses from existing content.
- JenniferSweeney4 days agoCommunity Member
EricSantos Hoping you can help. Image Extraction is turned on, and I can get a PDF source file to work, but If i try and upload images saved as any of the .DOC or .PPT files - it get an error message that says 'text cant be extracted'. See screenshot. I thought that was odd since the Articulate instructions clearly say you can upload .DOC or .PPT file types?
Additionally, when I compare inserting an image from the pdf source I uploaded vs. inserting an image as a svg or jpg file, the image quality is not the same. What it's pulling in from my PDF source file is blurry, vs when I upload from my machine as the svg or jpg the image is clear. Any tips on getting the images pulled from the source file to be clearer?
- EricSantos2 days agoStaff
Thanks for sharing those details, JenniferSweeney. That helps paint a clearer picture of what you’re seeing.
For the error with DOC and PPT files, it’s helpful to know that image extraction and text extraction are handled as separate processes. So it’s possible for one to work while the other doesn’t. In cases where the system isn’t able to read or extract text as expected, you may see that “text can’t be extracted” message even if the file type itself is supported.
On the image quality side, what you’re noticing can happen depending on how the images are stored in the source file. Extracted images may come through at a lower resolution compared to the original files, which is why directly uploaded SVG or JPG images can appear sharper.
Since this feature is still in Public Beta, the team is actively working on improving extraction behavior and exploring ways to preserve or enhance image quality over time.