It is nice that there is a sample node option for In-DB, however it isn't a random sample. It isn't always feasible for me to stream out and use the random sample % option. In fact on numerous occasions when I use the Data Stream Out option in DB I often times have workflows crash because it can't handle the number of records I am trying to stream out.