Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
4.3.3
-
None
Description
The bug we reported some time ago about null hosts in redirects seems to have regressed, albeit the old problem was with "null" and the new problem is with "blank":
2014-04-20 04:20:09,169 19319369 ERROR [FetchingThread-197] i.u.d.l.b.f.FetchingThread - Unexpected exception
java.lang.IllegalArgumentException: Host name may not be blank
at org.apache.http.util.Args.notBlank(Args.java:68) ~[httpcore.jar:4.3.2]
at org.apache.http.HttpHost.<init>(HttpHost.java:81) ~[httpcore.jar:4.3.2]
at org.apache.http.client.utils.URIUtils.extractHost(URIUtils.java:370) ~[httpclient.jar:4.3.3]
at org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:132) ~[httpclient.jar:4.3.3]
at org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:186) ~[httpclient.jar:4.3.3]
at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:72) ~[httpclient.jar:4.3.3]
at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:214) ~[httpclient.jar:4.3.3]
at org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:185) ~[httpclient.jar:4.3.3]
at it.unimi.di.law.bubing.util.FetchData.fetch(FetchData.java:322) ~[bubing-0.9.3.jar:na]
This is caused by this site:
> wget --max-redirect=0 http://www.thegamersedge.co.uk/robots.txt
-2014-04-20 20:47:43- http://www.thegamersedge.co.uk/robots.txt
Resolving www.thegamersedge.co.uk (www.thegamersedge.co.uk)... 72.1.201.156, 72.1.201.152
Connecting to www.thegamersedge.co.uk (www.thegamersedge.co.uk)|72.1.201.156|:80... connected.
HTTP request sent, awaiting response... 302 Moved Temporarily
Location: http://robots.txt [following]
0 redirections exceeded.