PyDataColl is a SCADA-like system which use Python as the main language. It originally inspired by an old program I made as for a core part of a large EMS.

PyDataColl can be roughly divided into three parts:

  • An APIServer provides RESTful Services for client to pull/push data from/to devices and perform generic CRUD on devices, terms and items.
  • A DeviceManager that manages all devices and terms connected to the system, listens messages send by APIServer that perform CRUD on devices and terms. It may be combined with some plugins to perform generic operation such as data checking, database saving and formula calculation.
  • Many devices and terms under control of DeviceManager operate with coded data over communication channels(TCP/IP) so as to provide control of remote equipment(meter or sensor). Each type of Device can communicate with one type of meter with specify protocol, such as Modbus TCP, IEC 60870-5-104.


Automatic installation:

pip install PyDataColl

PyDataColl is listed in PyPI and can be installed with pip or easy_install. Note that the source distribution includes unittest that are not present when PyDataColl is installed in this way, so you may wish to download a copy of the source tarball as well.

Manual installation: Download source code:

tar xvzf pydatacoll-0.1.tar.gz
cd pydatacoll-0.1
python build
sudo python install

The PyDataColl source code is hosted on GitHub.

Prerequisites: PyDataColl runs on Python 3.5+. In addition to the requirements which will be installed automatically by pip or install, the following optional packages may be useful:

  • Redis is heavily used by PyDataColl as NoSQL databases and IPC. If you deploy PyDataColl in local, make sure you have installed and started the Redis server.
  • MySQL is used by DbSaver plugin to store device data in real-time. If you deploy PyDataColl in local, make sure you have installed and started the MySQL server.
  • ujson is an ultra fast JSON encoder and decoder written in pure C with bindings for Python. This is an alternative json library and PyDataColl will use it automatically if possible.

Quick Start

  1. Simply run the following to start PyDataColl server(add “-h” to see all available parameter):



To stop server, press CTRL+C to exit.

  1. Visit http://localhost:8080 in browser to see the server information, if success, you will find something like this:

    PyDataColl is running, available API:
    method: GET      URL: http://localhost:8080/
    method: GET      URL: http://localhost:8080/api/v1/device_protocols
    method: GET      URL: http://localhost:8080/api/v1/devices
    (...more omitted)
  2. The server is running now. You can send request to server with your favorite http client! check RESTful API to see the API list.

Platforms: PyDataColl should run on any Unix-like platform, although for the best performance and scalability only Linux (with epoll) and BSD (with kqueue) are recommended for production deployment (even though Mac OS X is derived from BSD and supports kqueue, its networking performance is generally poor so it is recommended only for development use). PyDataColl will also run on Windows, although this configuration is not officially supported and is recommended only for development use.


This documentation is also available in PDF formats.


PyDataColl is offered under the Apache 2 license.

Indices and tables