Sisyphe-go is a golang command line application for recursive analysis of directories and files of scientific publishing corpus
example | 1 year ago | ||
kibanatemplates | 1 year ago | ||
nginx.conf.d | 1 year ago | ||
.dockerignore | 1 year ago | ||
.gitignore | 1 year ago | ||
Dockerfile | 1 year ago | ||
README.md | 1 year ago | ||
docker-compose.yml | 1 year ago | ||
go.mod | 1 year ago | ||
go.sum | 1 year ago | ||
indexCorpus.sh | 1 year ago | ||
main.go | 1 year ago | ||
pdf.go | 1 year ago | ||
pdf_test.go | 1 year ago | ||
struct.go | 1 year ago | ||
util.go | 1 year ago | ||
xml.go | 1 year ago | ||
xml_test.go | 1 year ago |
Sisyphe is a generic Golang recursive folder analyser terminal application
Tested with Golang 1.18
Works on Linux/OSX/Windows
Mount a corpus folder and :
docker-compose up -d docker exec -it sisyphe_go_go_1 go run . -n corpusName -c corpuspath -o outputpath
go build .
go run . --help
Will output help
--help Output usage -c Configuration folder path -n Corpus name (default "test") -o Output directory where results are written -p Corpus path -w Counting word on pdf -noindex Disable indexation after process
Just start Sisyphe on a folder with any files in it.
go run . ~/Documents/customfolder/corpus -n corpusname -o outputpath
go run . ~/Documents/customfolder/corpus -n corpusname -c ~/Documents/customfolder/corpusResources -o outputpath
Sisyphe is now working in background with all your computer thread. Just take a coffee and wait , it will prevent you when it's done :)
The results of sisyphe are present @ sisyphe/out/{timestamp}-corpusName/
(errors,info,duration..)
Just run go test
For cover go test -cover
pdftotext
and pdfinfo
)xmlstarlet
and xmllint
)