HarirData,is to help organizations and individuals

1. Maximize the value with their existing data assets

2. Develop data strategy and capability

3. Transforming big data into smart decisions.

Anchoring in the top research institutes, we aim at expediting knowledge exchange between academia and industry for big data application and business analytics; and creatively tackling the data challenges faced by organizations. High profile team consisting of domain experts & data scientists with extensive research & application experiences.


Kathy Ning Shen, PhD

Over 15 years of quantatitive analytics in marketing, information systems and related areas. Her main research areas include human-computer interaction and user experience,applications of information systems in organizations, e-marketing, virtual communities,and knowledge management. She has published more than 60 refereed journal and conference articles. Her work has appeared in top refereed j ournals and conferences in the fields.

Xitong Guo, PhD

Leading the big data analytics and ehealth research team at Harbin Institute of Technology. He has published over 40 papers in top journals and edited 3 books. His research on ehealth is supported by Major Program of National Natural Science Foundation of China.

Jinfeng Yang, PhD

Specialized in the field of natural language processing and information extraction with extensive experience with various software development skills such as java, SQL, python and etc; as well as different data analysis methods.


Besides the core team members, both industry experts and academic scholars are our extended team members.

Analyzing blog traffic
and blogosphere

Prof. Guo and his colleagues use chaos theory to find u nderlying order of blogging, including individual blog traffic dynamics and blogosphere structure. Key concepts of chaos theory are used to construct an interpretive framework to illustrate blog system behavior dynamics. Blog systems tend to be nonlinear, dynamic, and deterministic, as well as sensitive to initial conditions. The study also demonstrates the feasibility of applying chaos theory thinking to areas such as knowledge management and the recent global financial crisis.

Implications for practice and research opportunities are presented. The related publication is: Xitong Guo, Doug Vogel, Zhongyun Zhou, Xi Zhang, Huaping Chen. “Chaos Theory as a Lens for Interpreting Blogging,” Journal of Management Information Systems (JMIS), 2009, 26(1):101-128.

Text mining for electronic
medical record

EMRs are generated in the process of clinical treatments, documenting patients' health conditions and related medical knowledge. Dr. Yang and his colleagues developed methods to analyze EMR content which can be further used to support clinical decisions, practice of evidence-based medicine and other medical applications. Similar methods for text mining can be applied to the other areas, such as online consumer reviews, performance appraisal and etc. The related publications are:

YANG Jin-Feng, YU Qiu-Bin, GUAN Yi, JIANG Zhi-Peng. An Overview of Research on Electronic Medical Record Oriented Named Entity Recognition and Entity Relation Extraction. Acta Automatica Sinica, 2014, 40(8): 1537–1562 http://www.aas.net.cn/CN/abstract/abstract18425.shtml

YANG Jin-Feng, GUAN Yi, HE Bin, QU Chun-Yan, YU Qiu-Bin, LIU Ya-Xin, ZHAO Yong-Jie. Annotation Scheme and Corpus Construction for Named Entities and Entity Relations on Chinese Electronic Medical Records. Journal Of Software, 2016 http://www.jos.org.cn/ch/reader/view_abstract.aspx?file_no=4880

Data Model for

providing the right data at the right time in dynamic circumstances is critical for Business process integration and competitive advantages. Focusing on data flow, workflow management and mathematical techniques are developed to address data exchange problems in independent multi-stakeholder business process integration in dynamic circumstances.

We build artifacts that include inter-organizational workflow concepts, a workflow model, and a public dataset calculation method . The use of the proposed artifacts is illustrated by applying them to a real-world case in the Shenzhen (Chaiwan) port. The utility of the artifacts is evaluated through interviews with practitioners in industry.

We conclude that this research complements the control-flow perspective in the interorganizational workflow management area and also contributes to B2B information-sharing literature; further, the dataflow formalism can help practitioners to formally provide the right data at the right time in dynamic circumstances.

Xitong Guo*, Sherry Sun and Doug Vogel. “A Data Flow Perspective for Business Process Integratioin,” ACM Transactions on Management Information Systems (ACM TMIS), 5(4), pp1-33, 2014.http://dl.acm.org/citation.cfm?id=2629450


Contact Information

email   info@harirdata.com
phone   +971 50 7819 852
map   PO Box 390667, Dubai, United Arab Emirates
Email address *

Name *


Cooperation name

Contact number *

Content *

to top icon