Package org.apache.nutch.util
Class TableUtil
- java.lang.Object
-
- org.apache.nutch.util.TableUtil
-
public class TableUtil extends Object
-
-
Field Summary
Fields Modifier and Type Field Description static ByteBufferYES_VAL
-
Constructor Summary
Constructors Constructor Description TableUtil()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static StringgetReversedHost(String reversedUrl)Given a reversed url, returns the reversed host E.g "com.foo.bar:http:8983/to/index.html?a=b" -> "com.foo.bar"static StringreverseHost(String hostName)static StringreverseUrl(String urlString)Reverses a url's domain.static StringreverseUrl(URL url)Reverses a url's domain.static StringtoString(CharSequence utf8)Convert given Utf8 instance to String and and cleans out any offending "�" from the String.static StringunreverseHost(String reversedHostName)static StringunreverseUrl(String reversedUrl)
-
-
-
Field Detail
-
YES_VAL
public static final ByteBuffer YES_VAL
-
-
Method Detail
-
reverseUrl
public static String reverseUrl(String urlString) throws MalformedURLException
Reverses a url's domain. This form is better for storing in hbase. Because scans within the same domain are faster.E.g. "http://bar.foo.com:8983/to/index.html?a=b" becomes "com.foo.bar:8983:http/to/index.html?a=b".
- Parameters:
urlString- url to be reversed- Returns:
- Reversed url
- Throws:
MalformedURLException- if the input urlString is malformed
-
reverseUrl
public static String reverseUrl(URL url)
Reverses a url's domain. This form is better for storing in hbase. Because scans within the same domain are faster.E.g. "http://bar.foo.com:8983/to/index.html?a=b" becomes "com.foo.bar:http:8983/to/index.html?a=b".
- Parameters:
url- url to be reversed- Returns:
- Reversed url
-
getReversedHost
public static String getReversedHost(String reversedUrl)
Given a reversed url, returns the reversed host E.g "com.foo.bar:http:8983/to/index.html?a=b" -> "com.foo.bar"- Parameters:
reversedUrl- Reversed url- Returns:
- Reversed host
-
toString
public static String toString(CharSequence utf8)
Convert given Utf8 instance to String and and cleans out any offending "�" from the String.- Parameters:
utf8- Utf8 object- Returns:
- string-ifed Utf8 object or null if Utf8 instance is null
-
-