Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not all Get operations taken into account #5

Open
davidoesch opened this issue Jan 11, 2016 · 3 comments
Open

not all Get operations taken into account #5

davidoesch opened this issue Jan 11, 2016 · 3 comments
Assignees

Comments

@davidoesch
Copy link

Hi
IMHO you have to take into account all operations flag, not only "WEBSITE.GET."
missing
"REST.GET.OBJECT"
"SOAP.GET.“

Otherwise you will miss m2m requests..

@metaodi
Copy link
Member

metaodi commented Jan 11, 2016

@davidoesch I guess you're refering to the parse code, which currently only takes REST.GET.OBJECT into account.

I'm not really an expert, I guess the reason for this is to filter out "true" downloads of a file. Does WEBSITE.GET and SOAP.GET have a similar meaning?

@davidoesch
Copy link
Author

according to http://docs.aws.amazon.com/AmazonS3/latest/dev/LogFormat.html

a SOAP or a REST based download is IMHO a download as well... and we have some datasets on data.geo.admin.ch which are only consumed by REST

@metaodi metaodi self-assigned this Jan 11, 2016
@davidoesch
Copy link
Author

another solution is to download all the log files, merge/cat them into one file and then do some AWK magic
grep -E "REST.GET.OBJECT|WEBSITE.GET|SOAP.GET" YOURLOG.txt | awk '$13 ~ /200/ {array_bytes[$5 " " $9] +=$15;array_counter[$5 " " $9]++;} END {for (i in array_bytes) {printf "%s %.0f %.0f\n",i,array_counter[i],array_bytes[i]}}' > YOURRESULTS.log

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants