SeimiAgent

01 May 2016

A headless,standalone webkit server which make grabing dynamic web page easier.

中文文档

Quick Start

cd /dir/of/seimiAgent
./seimiagent -p 8000

SeimiAgent will start and listen on the port that you set.Than you can use any http client tools post a load reqest to SeimiAgent and get back the content which just like chrome do.Http client tools you can use: apache httpclient of java,curl of cmd,httplib2 of python including, but not limited to.

Demonstrates

demo

Http parameters that seimiAgent support

Only support post.Request path:/doload

How to build

It will take a very long time to build,so it is recommended to use the premade binary file in ‘Download’.

Requirements

sudo apt-get install build-essential g++ flex bison gperf ruby perl libsqlite3-dev libfontconfig1-dev libicu-dev libfreetype6 libssl-dev libpng-dev libjpeg-dev python libx11-dev libxext-dev
yum -y install gcc gcc-c++ make flex bison gperf ruby openssl-devel freetype-devel fontconfig-devel libicu-devel sqlite-devel libpng-devel libjpeg-devel

Build

python build.py

Then wait or take a cup of tea.

More

Seimi Family – SeimiCrawler,JsoupXpath