OpenRefine

Z Wikiverzity
Jak používat klasifikační nálepkuTato stránka je součástí projektu:
Příslušnost: všeobecná

OpenRefine je nástroj na zpracování masy dat (dříve známý pod názvem Google Refine)

Info[editovat]

Java[editovat]

OpenRefine (viz dále) potřebuje Javu 8 až 15, nepracuje s Java 16 a novějšími

Instalace pro Linux[editovat]

 tar xzf openrefine-linux-3.5.0.tar.gz
 ./refine
Using refine.ini for configuration
You have 7717M of free memory.
Your current configuration is set to use 1400M of memory.
OpenRefine can run better when given more memory. Read our FAQ on how to allocate more memory here:
https://github.com/OpenRefine/OpenRefine/wiki/FAQ-Allocate-More-Memory
/usr/bin/java -cp server/classes:server/target/lib/* -Xms1400M -Xmx1400M -Drefine.memory=1400M -Drefine.max_form_content_size=1048576 -Drefine.verbosity=info -Dpython.path=main/webapp/WEB-INF/lib/jython -Dpython.cachedir=/home/petr/.local/share/google/refine/cachedir -Drefine.webapp=main/webapp -Drefine.port=3333 -Drefine.interface=127.0.0.1 -Drefine.host=127.0.0.1 com.google.refine.Refine
Starting OpenRefine at 'http://127.0.0.1:3333/'

log4j:WARN No appenders could be found for logger (org.eclipse.jetty.util.log).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
Exception in thread "main" java.lang.UnsatisfiedLinkError: Can't load library: /usr/lib/jvm/java-11-openjdk-amd64/lib/libawt_xawt.so
        at java.base/java.lang.ClassLoader.loadLibrary(ClassLoader.java:2630)
        at java.base/java.lang.Runtime.load0(Runtime.java:768)
        at java.base/java.lang.System.load(System.java:1837)
        at java.base/java.lang.ClassLoader$NativeLibrary.load0(Native Method)
        at java.base/java.lang.ClassLoader$NativeLibrary.load(ClassLoader.java:2442)
        at java.base/java.lang.ClassLoader$NativeLibrary.loadLibrary(ClassLoader.java:2498)
        at java.base/java.lang.ClassLoader.loadLibrary0(ClassLoader.java:2694)
        at java.base/java.lang.ClassLoader.loadLibrary(ClassLoader.java:2648)
        at java.base/java.lang.Runtime.loadLibrary0(Runtime.java:830)
        at java.base/java.lang.System.loadLibrary(System.java:1873)
        at java.desktop/java.awt.Toolkit$3.run(Toolkit.java:1399)
        at java.desktop/java.awt.Toolkit$3.run(Toolkit.java:1397)
        at java.base/java.security.AccessController.doPrivileged(Native Method)
        at java.desktop/java.awt.Toolkit.loadLibraries(Toolkit.java:1396)
        at java.desktop/java.awt.Toolkit.<clinit>(Toolkit.java:1429)
        at java.desktop/java.awt.Component.<clinit>(Component.java:621)
        at com.google.refine.Refine.init(Refine.java:124)
        at com.google.refine.Refine.main(Refine.java:110)

Vidíme, že se spustil lokální WWW server na: http://127.0.0.1:3333/

Údajně nejlepší je použít browser, založený na Webkit, jako např.:

Zkusíme, otevře se to.

Nicméně vrátíme se k předchozí chybě a googlíme: Can't load library: /usr/lib/jvm/java-11-openjdk-amd64/lib/libawt_xawt.so

ldd /usr/lib/jvm/java-11-openjdk-amd64/lib/libawt_xawt.so
  ldd: /usr/lib/jvm/java-11-openjdk-amd64/lib/libawt_xawt.so: Adresář nebo soubor neexistuje
  • on ubuntu 20.04:
sudo apt install openjdk-11-jdk

Ukončíme server: Ctrl-C

A znovu nastartujeme:

./refine
Using refine.ini for configuration
You have 7717M of free memory.
Your current configuration is set to use 1400M of memory.
OpenRefine can run better when given more memory. Read our FAQ on how to allocate more memory here:
https://github.com/OpenRefine/OpenRefine/wiki/FAQ-Allocate-More-Memory
/usr/bin/java -cp server/classes:server/target/lib/* -Xms1400M -Xmx1400M -Drefine.memory=1400M -Drefine.max_form_content_size=1048576 -Drefine.verbosity=info -Dpython.path=main/webapp/WEB-INF/lib/jython -Dpython.cachedir=/home/petr/.local/share/google/refine/cachedir -Drefine.webapp=main/webapp -Drefine.port=3333 -Drefine.interface=127.0.0.1 -Drefine.host=127.0.0.1 com.google.refine.Refine
Starting OpenRefine at 'http://127.0.0.1:3333/'

log4j:WARN No appenders could be found for logger (org.eclipse.jetty.util.log).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/nvme0n1p7/DATAp7/mir/inst/OpenRefine/openrefine-3.5.0/webapp/WEB-INF/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/nvme0n1p7/DATAp7/mir/inst/OpenRefine/openrefine-3.5.0/server/target/lib/slf4j-log4j12-1.7.30.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
23:56:20.388 [                   refine] Starting OpenRefine 3.5.0 [d4209a2]... (0ms)
23:56:20.389 [                   refine] initializing FileProjectManager with dir (1ms)
23:56:20.389 [                   refine] /home/petr/.local/share/openrefine (0ms)
WARNING: An illegal reflective access operation has occurred
WARNING: Illegal reflective access by org.mozilla.javascript.MemberBox (file:/nvme0n1p7/DATAp7/mir/inst/OpenRefine/openrefine-3.5.0/webapp/WEB-INF/lib/js-1.7R2.jar) to method sun.net.www.protocol.file.FileURLConnection.getInputStream()
WARNING: Please consider reporting this to the maintainers of org.mozilla.javascript.MemberBox
WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations
WARNING: All illegal access operations will be denied in a future release
23:56:24.106 [                   refine] POST /command/core/load-language (3717ms)
23:56:24.134 [                   refine] GET /command/core/get-preference (28ms)
23:56:24.150 [                   refine] POST /command/core/load-language (16ms)
23:56:24.156 [                   refine] POST /command/core/load-language (6ms)
23:56:24.163 [                   refine] POST /command/core/load-language (7ms)
23:56:24.492 [                   refine] POST /command/core/get-importing-configuration (329ms)
23:56:24.533 [                   refine] GET /command/core/get-all-project-tags (41ms)
23:56:24.581 [                   refine] GET /command/core/get-all-project-metadata (48ms)
23:56:24.640 [                   refine] GET /command/database/saved-connection (59ms)
23:56:24.681 [                   refine] GET /command/core/get-csrf-token (41ms)
23:56:24.701 [                   refine] GET /command/core/get-languages (20ms)
23:56:24.746 [                   refine] GET /command/core/get-version (45ms)
23:56:56.931 [                   refine] POST /command/core/load-language (32185ms)
23:56:56.940 [                   refine] GET /command/core/get-preference (9ms)
23:56:56.944 [                   refine] POST /command/core/load-language (4ms)
23:56:56.948 [                   refine] POST /command/core/load-language (4ms)
23:56:56.953 [                   refine] POST /command/core/load-language (5ms)
23:56:57.157 [                   refine] POST /command/core/get-importing-configuration (204ms)
23:56:57.164 [                   refine] GET /command/core/get-all-project-tags (7ms)
23:56:57.179 [                   refine] GET /command/core/get-all-project-metadata (15ms)
23:56:57.196 [                   refine] GET /command/core/get-csrf-token (17ms)
23:56:57.208 [                   refine] GET /command/core/get-languages (12ms)
23:56:57.238 [                   refine] GET /command/core/get-version (30ms)
23:56:57.279 [                   refine] GET /command/database/saved-connection (41ms)
...

Chyba už nenastala, jen nějaké warningy

Jak něco udělat?[editovat]

Č. Co? Jak? Vysvětlivky
1 Jak odstranit mezery navíc?
2 Jak změnit prohlížeč, ve kterém se to otevře na Linuxu?
  1. spustit v terminálu přes openrefine
  2. zavřít prohlížeč, který to otevře
  3. dát si následující do požadovaného prohlížeče http://127.0.0.1:3333/
3 Jak updatovat na Linuxu?
  1. sudo apt remove openrefine
  2. sudo apt autoremove
  3. sudo snap install openrefine
  1. pokud byla předchozí verze nainstalována přes apt tak ji odebereme
  2. odinstalujeme neptořebné závislosti
  3. nainstalujem openrefine snap
4 Jak urychlit zpracování dat na Linuxu?

Související stránky[editovat]

Odkazy[editovat]