Skip to content

Latest commit

 

History

History
49 lines (43 loc) · 1.76 KB

README.md

File metadata and controls

49 lines (43 loc) · 1.76 KB

Motivation

In some scenario of stream computation, the raw data are usually collected from a message queue like kafka,which will be persistented for further analyze. For most of the time, the raw data are stored as a simple Java bean in memory and could easily be converted to a HashMap. The scala_sql is designed to run simple SQL query on the data structure List<Map>, allowing the programmer to do some real time computing.

Demo

suppose there is a List called user,which contains these data:

[name:tsc,age:30,sex:male]  
[name:syy,age:29,sex:female]  
[name:dudu,age:1,sex:male]  
[name:xiaohua,age:2,sex:null]  
[name:tsc,age:19,sex:male]  
[name:tsc,age:99,sex:female]  
[name:tsc,age:30,sex:female]  

With this SQL query,

Engine.query(user,"select count(user) as number,name from user group by name")

it will output this:

[number:1,name:dudu]  
[number:4,name:tsc]  
[number:1,name:xiaohua]  
[number:1,name:syy]  

Supported function and clause

max,min,sum,count,count(distinct),avg order by, group by, limit

Supported java/scala data type

String,Integer,Double,Date

Need to improve

1.Currently the Java/Scala data type are implicitly converted to the build-in type using the scala implicit class, however it can't not convert null which is a special type in Java. When doing computation like sum, order by on column which has a null value, the code will throw an exception. In another word, if a column has a null value, you can not do any computation on it.

2.The engine uses the build in function

sortBy[B](f: A => B)(implicit ord: Ordering[B]): Repr = sorted(ord on f)

to do sort the result list. However the engine does not support clause like this:

order by a desc, b asc