Hi, First, I would like to express my appreciation for your outstanding works in multimodal sentiment analysis, which have greatly inspired me.
But I have one problem about "Party GRU" of your DialogRNN code:

As shown in the figure above, my understanding is Speaker A's party state at time t-1 as input to the Party GRU. right?