← Back to packages

textract Not imported

Extracting text from files of various type including html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf, text/*, and various open office.

Import to Registry

This package is not yet in the greenflagged registry. Import it to start the review process.

[email protected]
2.5.0
Latest Version
MIT
License
46
Published Versions
14
Dependencies

Maintainers

dbashford

Keywords

textractextracthtmlcsvtextpdfdocxdocxlsxlsxpngjpggifrtfdxfpptxhtmlmarkdownxmlodtottxlsbxlsmxltxodsotspotxodgotgepub

Dependencies

PackageConstraint
mime 2.2.0
pdf-text-extract 1.3.1
xpath 0.0.23
xmldom 0.1.27
j 0.4.3
cheerio 1.0.0-rc.2
marked 0.6.2
meow 3.7.0
got 5.7.1
html-entities 1.2.0
iconv-lite 0.4.15
jschardet 1.4.1
yauzl 2.7.0
epub2 1.3.4

Published Versions

Version
2.5.0
2.4.0
2.3.0
2.2.0
2.1.2
2.1.1
2.1.0
2.0.0
1.2.1
1.2.0
1.1.2
1.1.1
1.1.0
1.0.4
1.0.3
1.0.2
1.0.1
1.0.0
0.20.0
0.19.0
0.18.0
0.17.0
0.16.0
0.15.0
0.14.0
0.13.2
0.13.1
0.13.0
0.12.0
0.11.2
0.11.1
0.11.0
0.10.1
0.10.0
0.9.0
0.8.0
0.7.1
0.7.0
0.6.0
0.5.0
0.4.1
0.4.0
0.3.0
0.2.0
0.1.0
0.0.1