relpipe/relpipe-web: comparison relpipe-data/principles.xml

equal deleted inserted replaced

-:c952261978e8
+:5b0fab48d59e
 			The <m:name/> data format should be concise – the data should be represented by reasonably small amount of bytes.
 			The format should support large amounts of small values and also sparse data (structures with many NULL/missing values) without wasting too much space.
 			The data that are not written don't need to be compressed and thus have the best compression ratio.
 		</p>
+		<h2>Streaming</h2>
+		<p>
+			Relational tools should process streams of data and should hold only necessary data in the memory
+			i.e. the tool should produce the output (the first record) as soon as possible while still reading the input (following records).
+			Thus the memory usage does not depend on the volume of processed data.
+		</p>
+		<p>
+			However, there are cases where such streaming is not feasible e.g. if we need to compute some statistics or a column widths while printing a table in the terminal.
+			In such situation, we must read the whole relation and only then generate the output.
+			But we should still be able to do streaming on the relations level e.i. if there are more relation, we always hold only one of them in the memory.
+		</p>
+		<p>
+			This rule is important not only from the performance point of view but also for user experience.
+			The user should see the output as soon as possible i.e. the longer running processes will produce result continuously instead of flushing everything at the end.
+			This is also good for debugging and <em>looking inside the things</em>.
+		</p>
 		<h2>Unambiguity</h2>
 		<p>
 			There should be only one way to represent a single value.
 			For example the booleans can be written as <code>00</code> (false) or <code>01</code> (true) and every other value (<code>02..FF</code>) should be invalid/unsupported.

branch	v_0
changeset 188	5b0fab48d59e
parent 150	7d7d4e1f293f
child 204	58c40f213028