SAS how to use first. with NOTSORTED

Richard 2020-02-05 01:14

A by-group is the sequence of rows that are adjacent, having the same by-var values.

NOTSORTED is for processing groups constructed from by values that are contiguous yet not sorted.

All your sample data has by-groups of size 1 because none of the id values are repeated looking down the column.

Here are two techniques you can try:

sort the data by referenceid and <some-other-sequencing-variable> and do normal by group processing.
maintain a hash of referenceid and hit-counts as you process the data set

Hash example (my sequenceId === your join_key):

data want;
  set have;
  if _n_ = 1 then do;
    declare hash ids();
    ids.defineKey('referenceid');
    ids.defineData('referenceid', 'sequenceId');
    ids.defineDone();
  end;

  if ids.find() ne 0 
    then sequenceId = 1;
    else sequenceId + 1;

  ids.replace();
run;

Related issues

How to convert date format from imported CSV to be able to merge data

Unable to apply a SAS macro to a column

SAS Column adding a backslash

Loop for checking multiple conditions in SAS with or without MACROS

Summarise and calculate the items specifically in the dataset using proc sql

The SAS way to loop over a table outside a data step

if statement conditions are embedded in a column

Combine two strings for file path in SAS

NodeJS reading and producing SAS files: xport and sas7bdat

create data step variables using a dynamic macro-variable