<p class="wp-block-paragraph">You don’t expect the tendrils of grief<br>To reach out and strangle your heart</p>
<p class="wp-block-paragraph">You don’t expect the moments of life<br>To call back the regrets, the pain, or the loss</p>
<p class="wp-block-paragraph">You don’t expect the places, spaces or times<br>To remind you of what you lost</p>
<p class="wp-block-paragraph">And yet – without expectation<br>Without planning<br>They rise up<br>And exhale<br>Unfurling</p>
<p class="wp-block-paragraph">It was as we drove in and he said<br>“I remember this place.”<br>“It was the last time.”<br>It was. The last time we saw her.</p>
<p class="wp-block-paragraph">But the pain isn’t even that.</p>
<p class="wp-block-paragraph">It’s years of loss.<br>Years of struggle<br>Years of heartache and hard work<br>Years of stress and stressors</p>
<p class="wp-block-paragraph">And these moments call it all back.</p>
<p class="wp-block-paragraph">In ways I didn’t expect.</p>
<p class="wp-block-paragraph">So today I sit.</p>
You don’t expect the tendrils of grief To reach out and strangle your heart
You don’t expect the moments of life To call back the regrets, the pain, or the loss
You don’t expect the places, spaces or times To remind you of what you lost
And yet – without expectation Without planning They rise up And exhale Unfurling
It was as we drove in and he said “I remember this place.” “It was the last time.” It was. The last time we saw her.
But the pain isn’t even that.
It’s years of loss. Years of struggle Years of heartache and hard work Years of stress and stressors
<p class="wp-block-paragraph"><strong>Looking at the Numbers · WHS Elders Consultation Survey</strong> (<a href="https://andrewyager.com/wp-content/uploads/2026/06/WHS_Survey_Reading_Final.pdf">Download the PDF version</a> | <a href="https://andrewyager.com/wp-content/uploads/2026/06/WHS_Survey_Summary.pdf">Download the PDF Summary</a> | <a href="https://andrewyager.com/wp-content/uploads/2026/06/Survey-Data-from-the-WHS-consultation.pdf">Download the Committee Report and Survey Results</a>)</p>
<p class="wp-block-paragraph"><em>Earlier, I wrote about the case for building the door — before we change the lock — and this rested on what we could learn from other churches and secular wisdom. The data from the consultation survey lets us stop guessing and look: it confirms the door has to exist. This is an argument about impact, not the theology of eldership. Make the change if it must be made — but Scripture, and our own tradition, ask us to build what it requires first.</em></p>
<p class="wp-block-paragraph">Let me say at the outset what this is not. It is not a case for or against male-only eldership. Someone can be a convinced complementarian, sure the change is biblically right, and still owe an honest account of what it will actually <em>do</em>. Deciding whether to act and seeing clearly what acting will cost are different things, and the second doesn’t go away once the first is settled. The survey was never a referendum on the theology, and neither is this. It asks one question: if the change is made, what does the denomination’s own data say will happen to its churches — and is it ready for that?</p>
<p class="wp-block-paragraph">The committee’s report opens with a reassurance: 91% feel safe always or most of the time, steady whether or not a congregation has women elders. That is good news, and we should want it to be true. It is also worth being clear about which question it answers. Nobody proposed this change expecting people to feel unsafe the next morning; how safe people feel on an ordinary Sunday was never really what was in dispute. The worry raised about the change was structural — what avenues exist, who gets heard, whether someone would speak up when something goes wrong. That kind of risk doesn’t show up in a question about present-day safety. It shows up in what people say they would <em>do</em>: whether they would report, whether they feel supported, what the churches that have actually lived this arrangement say about it. The safety headline is true and reassuring. The questions closer to the actual concern sit a row or two down in the same tables, and they read less reassuringly — which is why the rest of this looks there.</p>
<h2 class="wp-block-heading">First, how big is this response?</h2>
<p class="wp-block-paragraph">Before reading the signal, it’s worth knowing whether 1,787 voices is a lot or a little — because both cheap dismissals are wrong, and naming the real base disarms them. NSW has roughly 153,000 people who tick “Presbyterian and Reformed” on the census. But census identity is a cultural ghost, not a churchgoer. The PCNSW is a community of about 35,000; of those, around 14,000 adults are in average attendance across 183 congregations. <em>Those</em> are the people a survey about church life can actually reach.</p>
<figure class="wp-block-table"><table><thead><tr><th>What 1,787 is measured against</th><th>Figure</th><th>1,787 is…</th></tr></thead><tbody><tr><td>NSW census “Presbyterian and Reformed” (2021)</td><td>~153,000</td><td>~1% — the affiliation ghost, not the church</td></tr><tr><td>PCNSW “community” (self-described)</td><td>~35,000</td><td>~5%</td></tr><tr><td>Average adult attenders</td><td>~14,000</td><td>~13% — roughly one in eight</td></tr><tr><td>Congregations represented</td><td>117 of 184</td><td>~64% of all charges</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Census figure from ABS 2021; attendance and congregation counts are the denomination’s own current figures. Measured against the people who actually attend, this is a substantial, denomination-wide response — not a fringe, and not “barely 1%.”</em></p>
<p class="wp-block-paragraph">So set both dismissals aside. This is not a vocal minority — one in eight regular attenders, across nearly two-thirds of congregations, is a serious response by any church-survey standard. Nor is it a census; at 13% of attenders it is exactly the size where <em>who answered</em> decides what the numbers mean. That is the hinge the rest of this turns on.</p>
<p class="wp-block-paragraph">One caveat sits over everything below. Every figure here — the survey responses, and the denomination’s own counts of elders, attenders and vacancies — comes from partial returns. The survey heard from some attenders, not all. The denominational counts come from the congregations that reported, not every charge. And like every voluntary consultation, this one is subject to self-selection: those with strong views, either way, are likelier to answer. That doesn’t make the sample random. But two things make a small mobilised faction an unconvincing explanation for what it found. The first is sheer volume: around one in eight adult attenders responded, a high rate by any church-survey standard, and a fringe is rarely that large. The second is internal consistency — the same ordering across differently-worded questions, which a simple over-representation of the agitated on one side would not produce. Neither makes the result a population proportion, and I don’t read it as one; together they make it hard to dismiss as noise. So I’m reading what the respondents said, not pronouncing on settled populations — and the reading doesn’t need a precision the returns can’t give.</p>
<p class="wp-block-paragraph">The survey’s own demographic chart shows roughly how the 1,787 split by role. Reading it off the page gives approximately:</p>
<figure class="wp-block-table"><table><thead><tr><th>Who answered (approximate)</th><th>Share</th><th>~Respondents</th></tr></thead><tbody><tr><td>Members, regular attenders, volunteers</td><td>~88%</td><td>~1,570</td></tr><tr><td>Elders</td><td>~6%</td><td>~110</td></tr><tr><td>Ministers / pastoral workers / CGW</td><td>~3%</td><td>~50</td></tr><tr><td>Paid staff (incl. pastoral assistants)</td><td>~1.5%</td><td>~27</td></tr><tr><td>Deacons</td><td>~1.5%</td><td>~27</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Read off the survey’s demographic graph, so approximate. The lay cohort is the bulk of the response; the office-bearer columns are smaller, and two of them are small enough to need care.</em></p>
<h2 class="wp-block-heading">How to read these numbers before we start</h2>
<p class="wp-block-paragraph">A percentage from a small group is not a precise point; it is a band, and the width of the band comes straight from how many people answered. When only about 27 people are in a column, each one is a twenty-seventh of the total — one person in twenty-seven is nearly four points — so if just a handful had answered differently, the figure would swing several points at once. That movement is what makes the band wide: around nineteen points either way. Put more people in the column and each one counts for less. One in 110 elders is under a point; one in the 1,500-plus lay respondents is barely a twentieth of a point. So those figures hardly move, and the band tightens — to roughly nine points for elders, and just two or three across the lay column. Same survey, same logic: the more people answered, the less any one of them can shift the number, and the steadier it is.</p>
<p class="wp-block-paragraph">Statisticians call that band a 95% confidence range. Read the survey this way and the true figure should fall inside the band about nineteen times in twenty. So “47% of paid staff expect leadership to deteriorate” really means “somewhere in the region of 28% to 66%” — a wide band, and an honest one. That isn’t a flaw in the survey; it is just what small samples do, and once you read the figure as a band rather than a number, it behaves honestly.</p>
<p class="wp-block-paragraph">Here is the part that matters. The useful question is not “what is the exact figure?” but “which side of the line does the whole band sit on?” Take that same paid-staff figure. Even at the very bottom of its band — 28% — it still outweighs the 17% who expected leadership to improve. The band is wide, but it sits on the same side of the line from end to end. That is what a small sample can honestly tell you: not the precise number, but the direction, when the direction holds right across the margin.</p>
<p class="wp-block-paragraph">So read the small columns — paid staff and deacons, about 27 each — for which way they lean, not for their decimals. The precision lives in the big columns: over 1,500 lay respondents and around 110 elders, where the figures can be taken close to face value. Nothing below asks a 27-person column to carry a number on its own. It asks only whether each column leans the same way as the large ones — and for that, the small samples are entirely good enough.</p>
<p class="wp-block-paragraph">One more thing worth reading off the survey’s charts before the questions: who answered, by gender. The response was not heavily distorted by either gender — overall it ran about 56% female to 42% male, and the columns that carry most of the concern are mixed, the lay respondents around 60/40 and the paid staff about 52/48. Both men and women cared enough about this to answer, which means the sample is a reasonably representative cross-section of the church rather than a single-gender bloc. That matters for how you read everything below: a negative result here is not simply one group, unhappy at losing something, speaking for itself. One limit to be honest about — there is no published breakdown of answers by gender, so we can’t see whether, on a given question, one gender in a particular role was markedly more concerned than another. That cut would be worth having. But even without it, the balance of who responded tells us this is not the voice of one gender alone.</p>
<p class="wp-block-paragraph">The committee’s own summary of the results notes that the responses were not drastically different by gender, age or role, and that the sharpest variation was whether a congregation currently has female elders. That is fair, and worth holding alongside what follows. The point below is not that the roles split wildly from one another — on most questions they don’t. It is narrower: one cohort, the ministers, sits apart from all the others, who cluster together. Reading for that single break is not in tension with the committee’s summary; it is a closer look at the one place the otherwise broad agreement parts.</p>
<p class="wp-block-paragraph">For orientation on the offices themselves: the denomination’s 2026 snapshot records around <strong>700 elders</strong> (about 12% female), <strong>178 ordained ministers</strong> and ministry workers plus 26 commissioned gospel workers, and <strong>8 deaconesses</strong>. The office of <em>deacon</em> isn’t given a separate figure in the published data, which is why I read the deacons’-court column for direction rather than against any office total. So the ~50 ministers who answered are about a quarter of their office; the ~110 elders a useful slice of theirs; and the lay column simply enormous. Hold those rough sizes in mind — they shape what each survey column can and can’t tell us.</p>
<h2 class="wp-block-heading">The pattern the pie charts wash out</h2>
<p class="wp-block-paragraph">Take Q8 — whether excluding women from eldership would affect the quality of leadership. Read by role, one cohort stands apart from all the others. Ministers expect deterioration at 25%; everyone else — paid staff, elders, deacons, members — sits between 47% and 57%. The elders are not the unconcerned ones here: a clear majority of them, 54%, expect the quality of leadership to fall. So do the deacons in their pastoral role, and the members in the pews. It is the ministers, and the ministers alone, who break away from the rest:</p>
<figure class="wp-block-table"><table><thead><tr><th>Q8 — “Will affect leadership quality”</th><th>Minister/<br>PW/CGW</th><th>Paid Staff</th><th>Elder</th><th>Deacon</th><th>Member/<br>Adherent</th></tr></thead><tbody><tr><td>Yes, positively (will improve)</td><td>27%</td><td>17%</td><td>14%</td><td>3%</td><td>12%</td></tr><tr><td>Yes, negatively (will deteriorate)</td><td>25%</td><td>47%</td><td>54%</td><td>57%</td><td>56%</td></tr><tr><td>No change</td><td>45%</td><td>28%</td><td>29%</td><td>29%</td><td>23%</td></tr><tr><td>Unsure</td><td>2%</td><td>8%</td><td>4%</td><td>4%</td><td>10%</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Ministers are the outlier on the low end; every other role clusters high. The gradient — not the average — is the finding.</em></p>
<p class="wp-block-paragraph">This is the single most important shape in the data, and the aggregate pie chart erases it. The deacons’-court column sits at the top, and the four non-minister columns move together — a single ordering that the “All Respondents” pie cannot show.</p>
<p class="wp-block-paragraph">And it isn’t one question. The same ordering recurs across the survey, which is what turns a curiosity into a finding:</p>
<figure class="wp-block-table"><table><thead><tr><th>“Negative / increase / less likely” by role</th><th>Minister/<br>PW/CGW</th><th>Paid Staff</th><th>Elder</th><th>Deacon</th><th>Member/<br>Adherent</th></tr></thead><tbody><tr><td>Q9 — significant <em>negative</em> culture impact</td><td>22%</td><td>28%</td><td>37%</td><td>53%</td><td>43%</td></tr><tr><td>Q10 — <em>yes</em>, impacts relationships/teams</td><td>49%</td><td>61%</td><td>61%</td><td>74%</td><td>68%</td></tr><tr><td>Q11 — significant <em>negative</em> wellbeing impact</td><td>11%</td><td>17%</td><td>27%</td><td>33%</td><td>24%</td></tr><tr><td>Q13 — <em>increase</em> in conflict/bullying/harassment</td><td>24%</td><td>24%</td><td>40%</td><td>47%</td><td>31%</td></tr><tr><td>Q14 — staff/volunteers <em>less likely</em> to report</td><td>20%</td><td>31%</td><td>33%</td><td>53%</td><td>41%</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Five differently-worded questions. The same role ordering each time: ministers lowest, deacons highest, everyone else stacked in between.</em></p>
<p class="is-style-info wp-block-paragraph"><strong>Why the pattern is the finding, not the noise</strong><br>The same cohort ordering turns up across five differently-worded questions. A one-off result might be a quirk of a small sample; the same ordering recurring five times over is much harder to put down to isolated variation. When deacons, elders and members all point in the same direction on leadership, culture, relationships, wellbeing, conflict and reporting, while ministers sit lowest each time, that consistency is the evidence on its own — whatever the precision of any single percentage. One figure might sit anywhere within a margin. A pattern that holds across five differently-worded questions is not so easily explained away.</p>
<h2 class="wp-block-heading">Reading honestly in both directions</h2>
<p class="wp-block-paragraph">First, a discipline worth holding to, since the argument only earns its weight by observing it: not every question is a warning. Some results are genuinely divided, and a divided result isn’t a negative one. Where about as many expect improvement as expect harm, the honest word is “split,” and it would be wrong to count the negative half while setting the positive half aside. So here is each question sorted by what it actually shows. Naming the split ones as split is what makes it fair to name the lopsided ones as lopsided.</p>
<figure class="wp-block-table"><table><thead><tr><th>Question</th><th>Positive</th><th>Negative</th><th>Reads as</th></tr></thead><tbody><tr><td>Q8 — leadership quality (non-minister roles)</td><td>3–17%</td><td>47–57%</td><td>Lopsided — negative ~3× positive</td></tr><tr><td>Q9 — church culture (non-minister roles)</td><td>14–22%</td><td>56–70%</td><td>Skewed negative</td></tr><tr><td>Q11 — personal wellbeing (ministers)</td><td>25%</td><td>20%</td><td>Genuinely split — near wash</td></tr><tr><td>Q13 — conflict / bullying / harassment</td><td>3–11%</td><td>24–47%</td><td>Lopsided toward increase</td></tr><tr><td>Q14 — willingness to report</td><td>3–9%</td><td>20–53%</td><td>Strongly asymmetric</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Granting the washes doesn’t weaken the case; it clarifies it. Ministers’ own wellbeing answer (Q11) is close to even — said plainly, because it is true. The reporting and conflict questions are not even close.</em></p>
<p class="wp-block-paragraph">That is the thing to hold onto. The argument doesn’t need every number to lean one way, and several don’t. It rests on the questions that bear on structure and safety — reporting, conflict, the loss of a safeguarding voice — and those are the ones that aren’t washes. A balanced question elsewhere doesn’t settle those, because the balanced questions are not the ones carrying the weight.</p>
<h2 class="wp-block-heading">What each role’s numbers actually carry</h2>
<p class="wp-block-paragraph">Start with the ministers, because they sit at the low end of the gradient and it would be easy — and wrong — to wave them away. The survey’s ministry-workers column — about 178 ordained ministers and ministry workers in the denomination — drew around fifty responses, about a quarter of that office. That is a fair slice, not a thin or self-selected sliver. So the low minister concern is real, decently sampled, and sincerely held, and I am not going to explain it away. What makes the ministers worth pausing on isn’t that they are unrepresentative. It is that they are reading this change very differently from everyone else who answered — and they sit, with the elders, among those who will vote on it. That is the thread the next section picks up.</p>
<p class="wp-block-paragraph">The elders sit on firmer ground than the small office columns, and for a simple reason: more of them answered. Around 110 elders responded — far more than the ministers, deacons or paid staff — and that larger sample tightens the band from the ±19 of a 27-person column to something nearer ±9. So where the small columns can be trusted only for direction, the elder figure can be read closer to its face value. Of the elders who answered, around 54% expect leadership quality to deteriorate. No special pleading is needed for that one. It is the most ordinary number in the whole exercise — a clear majority of a decent sample, read at something like its actual value rather than as a wide band — and it leans the same way as the rest.</p>
<p class="wp-block-paragraph">Deacons need care for one straightforward reason: the size of the sample. About 27 people identifying with the deacons’ court answered — a small base, read as a band, not a figure. That is enough to read which way the column leans, not enough to trust its decimal. And of those who answered, the lean is the same as everyone else’s: around 57% expect things to get worse. Don’t take the 57% to the decimal, and don’t bin the column either. Read it for direction, and the direction puts it with the rest — in fact at the head of it.</p>
<h2 class="wp-block-heading">Are we all answering the same question?</h2>
<p class="wp-block-paragraph">Here the data turns from broadly agreed to genuinely striking — though not in the way you might expect. The ministers who answered are the outlier, lowest concern on every question, while elders, deacons, members and paid staff all read it as carrying a real cost. That gap is remarkable precisely because it is so disjointed: the people who lead these churches and the people who fill and serve them are looking at the same proposed change and reporting very different things about what it will do.</p>
<p class="wp-block-paragraph">It is worth asking why, and one possible explanation is worth offering — tentatively, because it concerns how people read a question rather than anything the survey directly measured. Some who support the move to male-only eldership see it as a matter of critical biblical obedience: not a question of cost and benefit at all, but of faithfulness to Scripture. Read that way, the impact question may land differently — a thing one believes is commanded is a thing the mind is less inclined to weigh a price against. Offered as a possibility, not a diagnosis, and not of any individual, it would explain the disjoint without anyone being foolish or self-serving: one group may be answering a question about obedience while the others answer a question about cost.</p>
<p class="wp-block-paragraph">And obedience is good; you do not refuse to obey because obedience is expensive. If those who support the overture are right that this is commanded, proceeding despite the cost may be exactly faithfulness, and I am not claiming to have settled that. But counting the cost and obeying anyway is not the same as obeying and seeing no cost — and we may not all agree which of those faithfulness requires. The conviction that something is right and the observation that it will be costly are not rivals; a church can hold both, and most of the survey’s respondents do.</p>
<p class="wp-block-paragraph">This matters for the decision, though not in the way it would if ministers simply outvoted everyone. They don’t. At Assembly the floor is, in principle, shared: each charge is represented by a minister and an elder, two voices of equal standing in the vote. So the room that decides is not the low-concern reading writ large. It is split down the middle — and the elder half of it is among the most concerned in the whole survey, with a clear majority expecting the quality of leadership to fall. The disjoint the data found across the church runs straight through the body that will vote on it.</p>
<p class="wp-block-paragraph">That is the thing worth sitting with. The elders who see a cost are not petitioners outside the room; they are full voters inside it, with every right to carry what they have seen into the decision. And the ministers, who are half that voice, have reason to listen hard to the other half — because the people who would absorb this change are telling them, in the church’s own consultation, that they expect it to be felt. The survey gave everyone an equal say; the floor gives ministers and elders an equal vote. On the evidence here, both halves of that floor have cause to take the concern seriously, and the elders in particular have the standing to act on it.</p>
<h2 class="wp-block-heading">Paid staff: the people no one disputes are covered</h2>
<p class="wp-block-paragraph">It is worth pulling paid workers out and looking at them alone, and not out of sentiment. The Work Health and Safety duty reaches further than paid staff: in law a “worker” includes volunteers, so a great many more people in the life of a church are covered than the payroll alone. Some will argue the church shouldn’t have to count volunteers — but that is an argument with the legislation, not with this piece, and I don’t need to win it. Set the whole question of volunteers aside. Even a reader who thinks the survey cast too broad a net by asking the entire church still has to reckon with the paid staff, because about them there is no dispute at all: they are employees, and there are a lot of them — 311 paid through central payroll in 2026, up from 292 the year before and 262 the year before that. Whatever your theology of eldership, they sit inside the WHS framework this consultation was convened to address. An elder’s discomfort is a governance matter. A paid worker’s psychosocial hazard is a duty owed by a person conducting a business or undertaking. The survey gives them their own column, and it repays reading.</p>
<figure class="wp-block-table"><table><thead><tr><th>Paid Staff — selected responses</th><th>Figure</th><th>Reads as</th></tr></thead><tbody><tr><td>Q8 — change will deteriorate leadership</td><td>47%</td><td>Near half</td></tr><tr><td>Q9 — negative culture impact</td><td>56%</td><td>28% “some” + 28% “significant”</td></tr><tr><td>Q10 — yes, impacts relationships/teams</td><td>61%</td><td>Clear majority</td></tr><tr><td>Q11 — some/significant negative wellbeing impact</td><td>37%</td><td>20% + 17%</td></tr><tr><td>Q14 — less likely to report issues</td><td>31%</td><td>Nearly a third</td></tr></tbody></table></figure>
<p class="wp-block-paragraph"><em>Paid staff also reported the highest “always feel safe” figure (72%) — safety and support are not the same thing, a distinction many respondents drew explicitly.</em></p>
<p class="wp-block-paragraph">And paid staff are not the cohort with the governance vote, nor mostly the female-elder congregations the report flags as over-represented. They are the people who turn up to work in these churches. Their column is small — about 27 answered — so read its figures as bands, not points, and let the legal principle carry the weight rather than any single percentage. The principle doesn’t need a large sample: paid staff are employees, and the duty is owed whether 27 of them answered or 270. When a column like theirs leans, as it does here, toward expecting people to be less likely to report conflict, bullying or harassment, that lean matters. The national model code of practice on managing psychosocial hazards — the more recent of the two, even if NSW hasn’t yet adopted its wording — recognises that this kind of conduct is routinely under-reported, out of fear of blame, escalation or reprisal, and that an absence of reports doesn’t mean nothing is wrong. It treats under-reporting as a marker of poor culture and a barrier to seeing hazards at all. The survey contained responses consistent with exactly that concern, from people the framework plainly covers — and they sit below the headline rather than beside it.</p>
<p class="is-style-info wp-block-paragraph"><strong>The finding that deserves the headline</strong><br>A consultation conducted under the banner of work health and safety — convened because of a safety concern — produced data in which, of those who answered, a plurality of several cohorts (around 53% of deacon respondents, 41% of members, 33% of elders, 31% of paid staff) expect people to become <em>less willing to report</em> conflict, bullying or harassment if the change is made, and that all-male sessions would be worse at seeing and handling it. In a church on the other side of a Royal Commission that examined exactly how unchallengeable authority and barriers to disclosure let harm continue, that is not a detail. Every safe-ministry structure the denomination has built assumes people will report. The summary leads with “91% feel safe,” which is true; but the reporting result is the one a safety process most needs to see, and it is worth bringing up to the light beside it.</p>
<h2 class="wp-block-heading">Where the summary and the sharper reading part</h2>
<p class="wp-block-paragraph">The committee’s summary is accurate, and nothing in it is false. But a summary makes choices about what to foreground, and these choices lean a consistent direction — toward the reassuring reading, with the sharper edges set a little further back. Five places show it.</p>
<ul class="wp-block-list">
<li><strong>The hardest comments are summarised, not shown.</strong> The trauma and abuse responses at Q9 are withheld “to ensure such responses are not identified” — a fair privacy rationale, but the effect is that the most serious material is the least auditable. The same is true of the “don’t trust current male leadership” subset at Q8 and the female elders who said they would feel ruled “unbiblical/evil.” We are told these exist; we cannot weigh them.</li>
<li><strong>Safe is quietly substituted for supported.</strong> The committee notes “in passing” that a large number of respondents marked themselves safe but <em>not</em> supported. That distinction is buried under the 91% headline, when it is arguably the more important of the two for a WHS frame.</li>
<li><strong>The sample skew is acknowledged, and it is worth testing rather than waving past.</strong> The report notes that female-elder congregations are over-represented (20% of churches, 31% of responses) and that the strongest predictor of perceived risk is whether a church currently has women elders. That is a fair caution against reading the raw negatives at face value, and it deserves a real answer rather than a rhetorical one. So here is the test — illustrative rather than a formal weighted analysis, but enough to show which way the correction cuts. The over-representation is an excess of about 11 percentage points (31% against an expected 20%). Take the least favourable assumption: that the entire 11-point excess would vanish from a properly weighted sample, and that it fell <em>uniformly</em> on the negative side — the worst case for reading the data as concerning. Strip that weight out of the pooled figures and the leans still hold: a 54% or 57% drops toward the mid-to-high 40s — still a plurality, still elevated, still pointing the same way. The finding was never a bare majority clinging to the 50% line; it was a wide and consistent lean, and an 11-point adjustment on the most-exposed slice doesn’t erase it. And the correction touches only the <em>pooled</em> aggregate. The role gradient is computed within each office column — it is a property of which office a respondent holds, not which church-type they attend — so the over-representation doesn’t enter it at all. The skew affects the “All Respondents” pie charts; it leaves the gradient, where the real signal lives, untouched. Run the test the caution invites, and the reading holds.</li>
<li><strong>The same skew cuts the other way, too.</strong> The female-elder churches answered at about one and a half times their share, and it is worth asking what that over-representation contains. These are the only respondents speaking from <em>current</em> experience of the arrangement the change would remove. Others write from past experience, or from watching other churches; these congregations have women in session today, and would feel the loss directly. So their responses may reasonably be read as particularly informative — not because they should carry extra statistical weight, but because they reflect lived experience of the very arrangement under discussion. The correction above already shows the leans survive even if their excess is stripped out entirely. What this adds is only that, set alongside the correction, their testimony is the survey’s closest thing to first-hand evidence about what removal would mean — and that is worth noticing, not discounting.</li>
<li><strong>The aggregate is foregrounded; the role split is “texture.”</strong> Every flagship chart is “All Respondents.” The role tables are present but never assembled into the cross-question comparison that reveals the gradient. The structure of the document makes the finding hard to see.</li>
</ul>
<h2 class="wp-block-heading">What the change closes has a size, and it’s 12% today</h2>
<p class="wp-block-paragraph">Until now the place women hold in the courts of the church — the standing by which they presently sit in session as of right, rather than by an elder’s invitation — has been an abstraction for many of us in this debate. Not because the figure was unknowable; the numbers were there for anyone who went looking. It is that, like much else here, few had gone looking, and fewer still had put it in front of the people arguing the question. The official figures let us put a size on it. The denomination’s 2026 snapshot records about 12% of elders as female; against an estimated 700 elders that is on the order of <strong>eighty women</strong> — roughly one in eight — though the firmly-counted figure, from the congregations that actually returned their forms, is a little lower, around sixty-six. Call it somewhere between seventy and eighty-five, denomination-wide. Whatever the exact number within that band, it is small, and the proposed change grandfathers those women while closing the door to any more.</p>
<p class="wp-block-paragraph">And the narrowing is not only a future event waiting on the vote. It has, in places, already begun. Female-elder churches are a minority to start with — about one congregation in five — and even among those, some will not appoint new women in anticipation of the change, while in some churches women who would serve as elders already cannot. None of that is necessarily wrong: if the theological frame is male-only eldership, this is simply that conviction being lived out ahead of the formal decision, and the people doing it are acting in good conscience. The point is only that we cannot discuss this standing as though it were intact and waiting. It is contracting now, by retirement and by choice, and every female elder lost — to relocation, to age, to a session that won’t replace her — is, under the proposed rule, unreplaceable. The number is small, and it has already started to fall.</p>
<p class="wp-block-paragraph">That reframes what the female-elder churches are telling us. When those congregations say they would lose something real, we can now put a rough number on the something: somewhere around eighty women, about one in eight elders, with no way to replace them under the proposed rule and only one direction to travel. These are also the only respondents speaking from <em>current</em> experience of what is at stake — women in session now, not a memory of them — which is about as close to evidence as a survey of anticipated effects can get.</p>
<h2 class="wp-block-heading">“Not enough men” was not hypothetical</h2>
<p class="wp-block-paragraph">Several respondents — ministers and members both — raised that in some congregations, particularly regional ones, the people suitable for eldership are sometimes women, and removing them would leave those churches short of leadership altogether. That concern is not idle. The denomination already carries real leadership-supply stress: of 183 pastoral charges, around 19% are genuinely vacant once session and CGW appointments are counted, and three-quarters of the un-appointed vacancies are regional. Whether those are the same churches the eldership change would touch is not something the figures tell us, and this piece does not claim it. But a denomination already short of available leaders, regionally, has less margin to absorb the loss of any eligible candidate — which is enough to make the “not enough men” concern a live structural risk rather than a hypothetical.</p>
<h2 class="wp-block-heading">The church reading, kept honestly</h2>
<p class="wp-block-paragraph">None of the above settles the theological question, and it isn’t trying to. A great many respondents who expect a negative impact say so while affirming that the change may still be the right and biblical one — the elder who writes that obedience to Scripture and leadership quality are simply different axes is making a coherent point, not dodging. The survey was explicitly not a referendum on whether the change is correct, and this piece isn’t one either. A complementarian can hold that male-only eldership is commanded <em>and</em> that the data here describes real costs that fall hardest on women, on paid staff, and on the churches that will have to absorb the change. Those two things are not in tension; the survey is full of people holding both at once.</p>
<p class="wp-block-paragraph">What the data does ask of the church is narrower and harder to refuse. If the change is made, the survey is close to unanimous on one point across every role and every theological camp: it cannot be made <em>well</em> without deliberate structures — protected avenues for women’s voices in the courts of the church, strengthened diaconal roles, clear reporting pathways that don’t depend on an all-male session, and genuine support (not merely assurances of safety) for the people most affected. Respondents who support the change and respondents who oppose it converged here. That convergence, like the role gradient, is a real finding — and it is the one the church can act on regardless of where it lands on the eldership question itself.</p>
<h2 class="wp-block-heading">The three questions, answered</h2>
<p class="wp-block-paragraph">Strip away the framing and there are three plain questions a reader actually wants answered. Here is what the data supports — and, as carefully, what it doesn’t.</p>
<h3 class="wp-block-heading">1. Are people concerned in ways that actually matter?</h3>
<p class="wp-block-paragraph">Yes — though not “statistically significant” in the p-value sense, because without published response counts nobody can compute that, and I won’t pretend otherwise. The significance that matters here is whether the concern is real and structured rather than noise, and it is. Among respondents it is the majority view in every non-minister column on the core question, around 47 to 57%. It holds across five differently-worded questions, which makes it hard to dismiss as isolated variation. It is shared right across the church — elders, deacons, members and paid staff all clustered high together — and only the ministers who answered sit apart from it. And it attaches to mechanisms rather than moods: reluctance to report, expected conflict, the loss of a safeguarding voice. Those are operational risks a board would minute, not feelings to be soothed.</p>
<p class="wp-block-paragraph">The floor matters as much as the ceiling. Set the highest figures aside entirely; the argument survives on the lowest. Across every cohort but ministers, somewhere between a quarter and three-fifths expect negative consequences — and even a quarter is too many to wave off. A church that takes safe ministry seriously doesn’t need a majority to warn it. It needs a credible minority, and it has one in every room. That holds whatever you make of the sampling, because it doesn’t lean on the big numbers being precise.</p>
<h3 class="wp-block-heading">2. Does the summary show what the data shows?</h3>
<p class="wp-block-paragraph">The conclusions are sound — and worth saying so plainly, because the committee’s report does good work: it sets out real options and calls for structures to be built. The question here is narrower, and it is about the <em>summary</em> of the data rather than the committee’s judgement. The safety headline is fairly reported and stable across church types. But a reader meets the reassuring number first and has to go looking for the harder one. The summary leads with <em>safe</em> and gives less room to the distinction respondents kept drawing between safe and <em>supported</em>. It foregrounds the aggregate, where the role gradient that carries the signal does not appear. And the Q14 reporting result — a plurality across several roles expecting people to become <em>less likely</em> to report conflict, bullying or harassment — sits below the headline rather than beside it, though it is just the kind of finding a work-health-and-safety process exists to surface. None of this is to fault the people who wrote it; summarising contested data under real pressure is hard, and a reassuring frame is an understandable place to land. It is only to say that the frame and the finding point in different directions, and the finding is the one a safety process most needs to see.</p>
<h3 class="wp-block-heading">3. Are the concerns founded?</h3>
<p class="wp-block-paragraph">As to cost and risk, yes. The committee notes that the strongest predictor of concern is whether a congregation <em>currently has female elders</em>, and that those churches are over-represented in the responses. That is the reason to read the raw aggregate with caution — and earlier I set out how a reader can correct for it, on the most hostile assumptions, and find the lean still holds. What remains after that test is the most telling thing in the data: the churches most worried about losing women from session are the ones that have women in session now. They are not reacting to an abstraction or a memory; they are living the arrangement the change would remove, and they feel its loss most sharply. That makes sense, and it is about as close to evidence as a survey of anticipated effects can get. What the data does <em>not</em> establish is that harm is inevitable: this measures expectation, not outcome, and the committee is right that good transition structures could mitigate much of it. But respondents on <em>both</em> sides agreed those structures would be needed. The concern is evidenced; the harm is contingent; and what it is contingent on is whether the structure gets built.</p>
<p class="is-style-info wp-block-paragraph"><strong>What would sharpen this — and what only the committee holds</strong><br>The survey’s own charts give us a good deal: the split of respondents by role, and by gender within each role, both of which I’ve used above. Two things would let a reader go further. First, exact respondent counts — I’ve read the bases off the demographic graph, so they are approximate, and the committee holds the precise figures. Second, and more useful, a cross-tabulation of answers by gender: not just how many men and women responded, but how each answered each question. That is the one cut that would show whether, on any given question, a particular gender in a particular role was markedly more concerned — and it is not in anything released. Both sit with the committee. Releasing them would let anyone replace my approximations with the real numbers and test every claim here. That would serve the whole church, and it would be welcomed.</p>
<h2 class="wp-block-heading">From what the data shows, to what we must do</h2>
<p class="wp-block-paragraph">Everything to here has been analysis, and analysis only goes so far. It can tell us what the people who answered expect, how consistently they expect it, and where the weight falls. It cannot tell us what to do about it. For that a church doesn’t turn to a survey, or to the Work Health and Safety Act, or to a committee’s framing. It turns to Scripture. So the rest of this changes its footing on purpose. We have read the data carefully and held it loosely where it was thin. Now comes the harder question — what faithfulness asks of us — and the church has to answer that as the church.</p>
<p class="wp-block-paragraph">Make the change to eldership if that is what the church, under God’s Word, discerns it must do. I have never contested that, and I do not contest it now. But the data — broad, consistent, leaning one way across every cohort but the ministers who answered — describes a cost that respondents across the church expect, and one they expect to fall hardest on women, on paid staff, and on the congregations that have actually lived the arrangement now being removed. Deciding in obedience is one thing. Doing it well is another, and that depends on whether we build what the change requires. Building it is obedience too.</p>
<p class="wp-block-paragraph">For we are not first a workplace discharging a statutory duty. We are a people under a Lord who commands the protection of the vulnerable and the restraint of those who shepherd. “Speak up for those who cannot speak for themselves,” says Proverbs; “defend the rights of the poor and needy.” Ezekiel’s indictment falls on shepherds who ruled the flock with force and did not bind up the injured. Peter charges the elders to shepherd the flock “not domineering over those in your charge, but being examples.” The Westminster divines, reading the sixth commandment, named among its plain duties the “comforting and succouring the distressed, and protecting and defending the innocent.” None of this is borrowed from the culture. It is older than the WHS Act by millennia, and it asks more of us than the Act ever could. Where the best secular standard now says a workplace must make it safe to speak up, it is, belatedly, agreeing with our own book.</p>
<p class="wp-block-paragraph">And it is easy to miss that Presbyterian government is <em>itself</em> a piece of safeguarding. The plurality of elders, the graded courts that hold one another to account — we have these because the tradition recognised long ago that fallen leaders need accountability, and that no one should hold unchecked power over another soul. Narrow who sits in those courts without strengthening the ways the vulnerable reach them, and you weaken the very protection the system was built to give. The answer isn’t to reopen the question of the courts. It is to stay consistent with the logic that gave us courts at all.</p>
<p class="wp-block-paragraph">What faithfulness asks for here is concrete, and it is worth being precise about, because some will rightly say the denomination already has external mechanisms — the Conduct Protocol Unit, Breaking the Silence, channels that reach past the local session. That is true, and to the church’s credit. But three things must be said about them. The first is that the survey is not measuring whether a pathway exists; it is measuring whether people believe they would be <em>heard</em> — whether someone who is hurt trusts they will be listened to, believed, and not made to pay for having spoken. The second is that these external channels are a backstop, and a backstop that, in practice, often routes a matter back to the session to be dealt with. So they do not let the session off the hook; they lean on it. The third is that, for much of what actually wounds people, the formal machinery is the wrong size and the wrong shape. It is built for the serious and the broken-down. Most of what hurts people in a church is smaller and more ordinary than that — the concern brushed aside, the decision made without them, the sense of not being heard — and reaching for a denominational protocol unit over it can feel too heavy, or simply too alien, especially in smaller or more informal congregations. So it goes unused, and the hurt goes unspoken.</p>
<p class="wp-block-paragraph">Which means the real ask is not a better way <em>around</em> the session. It is a session people do not need to go around. What we should want — what Scripture asks of elders directly — is that the men who sit in session be loving, kind, patient, and like Christ: shepherds who do not lord it over those in their charge, who bind up the injured rather than rule with force. We should want local congregations to be places where someone who has been hurt is listened to and cared for by their own church, where the ordinary wound is tended before it festers, and where the external supports can sit there, available and trusted, precisely because they are rarely needed. That is a far harder thing to build than a reporting form. It is the work of character, not procedure — and it is exactly the work the office of elder is for. The safeguarding the survey asks for and the eldership Scripture describes turn out to be the same thing: elders people are not afraid to come to.</p>
<p class="wp-block-paragraph">Around that sit the supports that make it real. Access for women, as of right rather than at an elder’s discretion, to the bodies that handle pastoral care, conflict and discipline. Newer diaconal roles encouraged, taken up, and given time to establish, rather than left as a provision on paper. Genuine support for those most affected, not just the reassurance that most people already feel safe. None of this touches the theology of eldership. All of it is what the data says is needed, and what Scripture says is right — and for a church those are one thing, not two.</p>
<p class="wp-block-paragraph">And none of this is a fringe demand. The committee running the consultation, setting out the paths open to the Assembly, names exactly this one: to link any change to formal ways of including women’s voices, so that the current standard of safety is maintained or improved. They observe it is the option that would best fit what respondents asked for, were a change to be made — while rightly noting that asking for structure alongside a change is not the same as asking for the change. That is the path this reading points to as well. Build the structure into the change, not after it. The body closest to the data has put the same option on the table.</p>
<p class="wp-block-paragraph">None of this is new. When the results of this survey did not yet exist — when all we had were the questions it asked and the evidence that bore on them — I set out what I thought would need to be true before this change could responsibly be made: that a real mechanism for women’s voices in the courts of the church be built and tested <em>before</em> the door closes, not promised afterward; that our safeguarding be shown to work rather than merely assumed to; that there be a way to raise a concern that does not depend on the goodwill of the very people it might be about; that the church know which path a given hurt is meant to travel; and that the commitments we have already made to honour men and women alike be carried out, not left on paper. I made that case from governance research, from public inquiries into institutional harm, from the experience of churches that had already walked this road — and I was careful to call such evidence suggestive rather than proof, because perhaps we were different. That was the honest caveat. The survey has now answered it. What we know now is that the external evidence was never only external. The pattern the research described, our own people have described back to us — in their own pews, in their own words. We are not the exception the caveat left room for. The outside was a picture of the inside all along.</p>
<p class="wp-block-paragraph">And I must say plainly that more has been done than I knew when I first wrote, and that it is good. At presbytery level there is now a compulsory female representative for safe-ministry, and a compulsory female support person for anyone who makes a complaint; presbyteries have been urged to keep a list of women able to support complainants, and there is provision for female presbytery affiliates. At Assembly, deaconesses and key women have been given rights described as equivalent to an associate member, a 2025 overture would extend that to women serving on committees, a committee is examining bullying at the local-church level, and the women’s engagement committee has brought overtures to widen women’s standing further still. At the session level, churches are being encouraged to establish deacons’ courts, to invite women to attend session, and to appoint both men and women as safe-ministry representatives. None of this is nothing. The denomination is not standing still, and the direction is right. I’m glad of it, and I won’t take any of it back.</p>
<p class="wp-block-paragraph">And it is not only the formal measures. In many churches, ministers and sessions have taken this conversation seriously and handled it well — listening carefully, leading their congregations through hard ground with patience, making space for women’s voices long before any rule required it. Their congregations and communities have noticed, and have thanked them for it. That work is real, and good, and I don’t want a word of what follows to be heard as discounting it. If anything, it is the proof of the point: these are the churches showing what it looks like done well. The question the survey raises is not whether it <em>can</em> be done well — plainly it can, and is — but whether the people most affected can count on it being done well everywhere, by whoever happens to be in the chair, rather than where the right person happens to be sitting in it.</p>
<p class="wp-block-paragraph">Because hold it up to the light and two things show through. The first is that almost none of it is yet <em>finished</em>, and much of what exists grants a voice without a vote. Some is encouragement a church may take or leave. Much of the rest gives rights “equivalent to an associate member” — and that phrasing is worth understanding, not as a slight but as a mechanism. An associate member is someone who already holds a full vote in another court of the church and is recognised, and seated, when visiting this one. The status is dignified precisely because the vote lives at home. To grant women rights <em>equivalent</em> to that is to extend the recognition — a real and good thing — but there is no home court underneath it, no vote anywhere for the equivalence to rest on. It copies what an associate looks like in the room, not what an associate <em>is</em>. The opportunity to speak is a genuine gift. The opportunity to have a say is a greater one, and these measures, real as they are, stop at the first. A good deal more, too, is still proposed rather than enacted — real intent that has not yet reached the Code where it would become something a person could rely on as of right. Being in motion to a destination is not yet having arrived. The committee itself, weighing the options open to the Assembly, makes a kindred observation: that across twenty years or more of this debate, only a very small change has so far been adopted — the granting of associate-equivalent rights to deaconesses and committee members. That is a candid measure of the distance between motion and arrival, set down by the body running the consultation.</p>
<p class="wp-block-paragraph">The second is less obvious, but perhaps more telling. I showed an earlier draft of this piece to a friend, and they were able to gather these efforts together for me. Echoes of them had surfaced in earlier conversations — but this was the first time anyone had put a concrete list in front of me. And it is not that I hadn’t looked, or hadn’t asked. I am not standing at a distance from this process. If it takes someone assembling a bespoke list before an engaged member can see the progress whole, then the question is no longer only whether these things exist. It is how known they are, and how trusted, by the people they are meant to protect. A safeguard that the committee that built it understands, and the person in the pew has never heard of, is not yet doing the work of making anyone feel safe to speak. That is the same gap the survey found, one court higher: the distance between a thing that exists and a thing that is known, trusted, and reached for when it is needed. The progress is real. It is also, so far, strongest where the hurt is most distant, and least settled exactly at the session, where the survey says the hurt is closest.</p>
<p class="wp-block-paragraph">I want to be honest about the limits of my own judgement. I don’t know whether what has been done is enough — whether it truly meets the concerns the survey surfaced, or only gestures toward them. It may be that the work is sound and only its visibility is lacking; if so, that is good news, and I will gladly be wrong. But I don’t think we are there yet, and I’ll say why, because the reason is one anyone can weigh. A church is being asked to trust that its courts, and the culture around them, are healthy enough to hear the vulnerable and treat them well. The clearest evidence we have about that culture is the way this very debate is being conducted — its tone, and the calibre of the discourse around it, in our courts and online. And that conduct does not yet read like a culture that has arrived. A conversation about caring for the vulnerable that is not always carried on with care is itself the answer to whether the structures can simply be trusted to work. I hold that as a sense, not a proof. But it is a sense with public evidence behind it, and it tells me there is still work to do.</p>
<p class="wp-block-paragraph">I don’t have a vote in this. A good many of us don’t. But a vote and a voice aren’t the same thing. A church that remembers the difference, and makes room for the voice of those who have no vote, is only being what Presbyterians have always said the church is — a body where the weakest members are indispensable, and where no one is heard only when someone else allows it. Make the change, if it must be made. But build the structure first. The survey has told us, in the church’s own words, what its people expect it to cost if we don’t. Scripture told us long before the survey did.</p>
<p class="is-style-info wp-block-paragraph"><strong>Notes on method and sources.</strong> (1) All figures here are partial returns, not absolute totals — the survey sampled attenders, and the denominational counts are built only from congregations that reported. Percentages should be read as “of those who answered,” not as population proportions. Approximate respondent bases, read off the survey’s demographic chart: lay (member/attender/volunteer) ~1,570; elders ~110; ministers/PW/CGW ~50; paid staff ~27; deacons ~27. Gender split, read off the same chart and cross-checked against the ~56% female headline: overall ~56% F / ~42% M; lay ~60/40; paid staff ~52/48; deacon ~64% F; elder ~20% F; minister ~9% F. No published cross-tabulation of answers by gender exists, so no claim is made about how either gender answered any individual question. At a base of ~27 the 95% margin on a single proportion is roughly ±19 points (so a 47% reads as a 28–66% band); at ~110 it is about ±9 points, and at ~1,570 about ±2–3 points. That is why the paid-staff and deacon columns are read for direction (which side of the line the band falls) rather than for their point value, while the elder and lay columns can be taken close to face value. Survey percentages are from the WHS consultation data document, Q8–Q15; the denomination’s own current counts, from the PCNSW 2026 Snapshot to the General Assembly, run to roughly 700 elders (~12% female), 178 ordained ministers/ministry workers, 26 CGWs, 8 deaconesses, 311 central-payroll staff (2026, up from 292 and 262 in the two prior years), ~13,500 adult attenders within a total average attendance of 18,088, 183 pastoral charges, and a real vacancy of about 19% (three-quarters of un-appointed vacancies regional). The estimate of around eighty female elders is 12% of the ~700 total; the firmly-counted figure within the 144 reporting congregations is about 66. (2) The minister low-concern reading is offered as a divergence in how the cohorts read the question — some ministers framing the change as a matter of obedience rather than cost — not as a claim of minister under-representation, nor as an imputation of motive. (3) Elder figures rest on ordinary sample strength (a large office, an unremarkable sample), not on any finite-population or census claim. (4) The deacons’-court column rests on about 27 self-identified respondents — a small base, read for direction and consistency with the gradient only; no precise-value, size, or representativeness claim is made for it. (5) Net-negative figures for Q9/Q11 sum the “some” and “significant” negative bands; these additions have been checked against the survey tables (e.g. paid-staff Q9 = 28% + 28% = 56%; minister Q11 positive 25% / negative 20%). (6) The 1,787 / 117-charge figures are from the survey results document itself. (7) The reporting claim is framed to the Safe Work Australia national model code of practice on managing psychosocial hazards (2022), which treats under-reporting of bullying/harassment as a marker of poor culture and a barrier to hazard identification; the approved NSW code (2021) does not yet use this wording, and the text says so. The claim is deliberately not pinned to the NSW Regulation, which does not name reporting behaviour. (8) The central conclusion — a consistent, directional, cross-question gradient with ministers lowest and every other cohort elevated — is a property of the responses and does not depend on any population denominator; the partial-returns caveat does not weaken it. (9) The committee’s summary of the results, its four risk-management paths (including the option of linking any change to formal structures for women’s voices), and its observation that twenty-plus years have produced only associate-equivalent rights reflect the Elders Consultation Special Committee’s reporting to the Assembly (Ben Greig, Convener). Where the committee’s earlier report and the final survey results differ, the final results are treated as authoritative. The committee’s note that responses were not drastically different by gender, age or role is consistent with the reading here: most roles cluster together, and the analysis isolates the single cohort — ministers — that sits apart.</p>
<p class="is-style-info wp-block-paragraph">Some of this analysis was prepared with the assistance of AI tools, used for research, data-checking, and drafting. The argument, the judgements, and the position it reaches are mine, and I take responsibility for them.</p>
Earlier, I wrote about the case for building the door — before we change the lock — and this rested on what we could learn from other churches and secular wisdom. The data from the consultation survey lets us stop guessing and look: it confirms the door has to exist. This is an argument about impact, not the theology of eldership. Make the change if it must be made — but Scripture, and our own tradition, ask us to build what it requires first.
Let me say at the outset what this is not. It is not a case for or against male-only eldership. Someone can be a convinced complementarian, sure the change is biblically right, and still owe an honest account of what it will actually do. Deciding whether to act and seeing clearly what acting will cost are different things, and the second doesn’t go away once the first is settled. The survey was never a referendum on the theology, and neither is this. It asks one question: if the change is made, what does the denomination’s own data say will happen to its churches — and is it ready for that?
The committee’s report opens with a reassurance: 91% feel safe always or most of the time, steady whether or not a congregation has women elders. That is good news, and we should want it to be true. It is also worth being clear about which question it answers. Nobody proposed this change expecting people to feel unsafe the next morning; how safe people feel on an ordinary Sunday was never really what was in dispute. The worry raised about the change was structural — what avenues exist, who gets heard, whether someone would speak up when something goes wrong. That kind of risk doesn’t show up in a question about present-day safety. It shows up in what people say they would do: whether they would report, whether they feel supported, what the churches that have actually lived this arrangement say about it. The safety headline is true and reassuring. The questions closer to the actual concern sit a row or two down in the same tables, and they read less reassuringly — which is why the rest of this looks there.
First, how big is this response?
Before reading the signal, it’s worth knowing whether 1,787 voices is a lot or a little — because both cheap dismissals are wrong, and naming the real base disarms them. NSW has roughly 153,000 people who tick “Presbyterian and Reformed” on the census. But census identity is a cultural ghost, not a churchgoer. The PCNSW is a community of about 35,000; of those, around 14,000 adults are in average attendance across 183 congregations. Those are the people a survey about church life can actually reach.
What 1,787 is measured against
Figure
1,787 is…
NSW census “Presbyterian and Reformed” (2021)
~153,000
~1% — the affiliation ghost, not the church
PCNSW “community” (self-described)
~35,000
~5%
Average adult attenders
~14,000
~13% — roughly one in eight
Congregations represented
117 of 184
~64% of all charges
Census figure from ABS 2021; attendance and congregation counts are the denomination’s own current figures. Measured against the people who actually attend, this is a substantial, denomination-wide response — not a fringe, and not “barely 1%.”
So set both dismissals aside. This is not a vocal minority — one in eight regular attenders, across nearly two-thirds of congregations, is a serious response by any church-survey standard. Nor is it a census; at 13% of attenders it is exactly the size where who answered decides what the numbers mean. That is the hinge the rest of this turns on.
One caveat sits over everything below. Every figure here — the survey responses, and the denomination’s own counts of elders, attenders and vacancies — comes from partial returns. The survey heard from some attenders, not all. The denominational counts come from the congregations that reported, not every charge. And like every voluntary consultation, this one is subject to self-selection: those with strong views, either way, are likelier to answer. That doesn’t make the sample random. But two things make a small mobilised faction an unconvincing explanation for what it found. The first is sheer volume: around one in eight adult attenders responded, a high rate by any church-survey standard, and a fringe is rarely that large. The second is internal consistency — the same ordering across differently-worded questions, which a simple over-representation of the agitated on one side would not produce. Neither makes the result a population proportion, and I don’t read it as one; together they make it hard to dismiss as noise. So I’m reading what the respondents said, not pronouncing on settled populations — and the reading doesn’t need a precision the returns can’t give.
The survey’s own demographic chart shows roughly how the 1,787 split by role. Reading it off the page gives approximately:
Who answered (approximate)
Share
~Respondents
Members, regular attenders, volunteers
~88%
~1,570
Elders
~6%
~110
Ministers / pastoral workers / CGW
~3%
~50
Paid staff (incl. pastoral assistants)
~1.5%
~27
Deacons
~1.5%
~27
Read off the survey’s demographic graph, so approximate. The lay cohort is the bulk of the response; the office-bearer columns are smaller, and two of them are small enough to need care.
How to read these numbers before we start
A percentage from a small group is not a precise point; it is a band, and the width of the band comes straight from how many people answered. When only about 27 people are in a column, each one is a twenty-seventh of the total — one person in twenty-seven is nearly four points — so if just a handful had answered differently, the figure would swing several points at once. That movement is what makes the band wide: around nineteen points either way. Put more people in the column and each one counts for less. One in 110 elders is under a point; one in the 1,500-plus lay respondents is barely a twentieth of a point. So those figures hardly move, and the band tightens — to roughly nine points for elders, and just two or three across the lay column. Same survey, same logic: the more people answered, the less any one of them can shift the number, and the steadier it is.
Statisticians call that band a 95% confidence range. Read the survey this way and the true figure should fall inside the band about nineteen times in twenty. So “47% of paid staff expect leadership to deteriorate” really means “somewhere in the region of 28% to 66%” — a wide band, and an honest one. That isn’t a flaw in the survey; it is just what small samples do, and once you read the figure as a band rather than a number, it behaves honestly.
Here is the part that matters. The useful question is not “what is the exact figure?” but “which side of the line does the whole band sit on?” Take that same paid-staff figure. Even at the very bottom of its band — 28% — it still outweighs the 17% who expected leadership to improve. The band is wide, but it sits on the same side of the line from end to end. That is what a small sample can honestly tell you: not the precise number, but the direction, when the direction holds right across the margin.
So read the small columns — paid staff and deacons, about 27 each — for which way they lean, not for their decimals. The precision lives in the big columns: over 1,500 lay respondents and around 110 elders, where the figures can be taken close to face value. Nothing below asks a 27-person column to carry a number on its own. It asks only whether each column leans the same way as the large ones — and for that, the small samples are entirely good enough.
One more thing worth reading off the survey’s charts before the questions: who answered, by gender. The response was not heavily distorted by either gender — overall it ran about 56% female to 42% male, and the columns that carry most of the concern are mixed, the lay respondents around 60/40 and the paid staff about 52/48. Both men and women cared enough about this to answer, which means the sample is a reasonably representative cross-section of the church rather than a single-gender bloc. That matters for how you read everything below: a negative result here is not simply one group, unhappy at losing something, speaking for itself. One limit to be honest about — there is no published breakdown of answers by gender, so we can’t see whether, on a given question, one gender in a particular role was markedly more concerned than another. That cut would be worth having. But even without it, the balance of who responded tells us this is not the voice of one gender alone.
The committee’s own summary of the results notes that the responses were not drastically different by gender, age or role, and that the sharpest variation was whether a congregation currently has female elders. That is fair, and worth holding alongside what follows. The point below is not that the roles split wildly from one another — on most questions they don’t. It is narrower: one cohort, the ministers, sits apart from all the others, who cluster together. Reading for that single break is not in tension with the committee’s summary; it is a closer look at the one place the otherwise broad agreement parts.
For orientation on the offices themselves: the denomination’s 2026 snapshot records around 700 elders (about 12% female), 178 ordained ministers and ministry workers plus 26 commissioned gospel workers, and 8 deaconesses. The office of deacon isn’t given a separate figure in the published data, which is why I read the deacons’-court column for direction rather than against any office total. So the ~50 ministers who answered are about a quarter of their office; the ~110 elders a useful slice of theirs; and the lay column simply enormous. Hold those rough sizes in mind — they shape what each survey column can and can’t tell us.
The pattern the pie charts wash out
Take Q8 — whether excluding women from eldership would affect the quality of leadership. Read by role, one cohort stands apart from all the others. Ministers expect deterioration at 25%; everyone else — paid staff, elders, deacons, members — sits between 47% and 57%. The elders are not the unconcerned ones here: a clear majority of them, 54%, expect the quality of leadership to fall. So do the deacons in their pastoral role, and the members in the pews. It is the ministers, and the ministers alone, who break away from the rest:
Q8 — “Will affect leadership quality”
Minister/ PW/CGW
Paid Staff
Elder
Deacon
Member/ Adherent
Yes, positively (will improve)
27%
17%
14%
3%
12%
Yes, negatively (will deteriorate)
25%
47%
54%
57%
56%
No change
45%
28%
29%
29%
23%
Unsure
2%
8%
4%
4%
10%
Ministers are the outlier on the low end; every other role clusters high. The gradient — not the average — is the finding.
This is the single most important shape in the data, and the aggregate pie chart erases it. The deacons’-court column sits at the top, and the four non-minister columns move together — a single ordering that the “All Respondents” pie cannot show.
And it isn’t one question. The same ordering recurs across the survey, which is what turns a curiosity into a finding:
“Negative / increase / less likely” by role
Minister/ PW/CGW
Paid Staff
Elder
Deacon
Member/ Adherent
Q9 — significant negative culture impact
22%
28%
37%
53%
43%
Q10 — yes, impacts relationships/teams
49%
61%
61%
74%
68%
Q11 — significant negative wellbeing impact
11%
17%
27%
33%
24%
Q13 — increase in conflict/bullying/harassment
24%
24%
40%
47%
31%
Q14 — staff/volunteers less likely to report
20%
31%
33%
53%
41%
Five differently-worded questions. The same role ordering each time: ministers lowest, deacons highest, everyone else stacked in between.
Why the pattern is the finding, not the noise The same cohort ordering turns up across five differently-worded questions. A one-off result might be a quirk of a small sample; the same ordering recurring five times over is much harder to put down to isolated variation. When deacons, elders and members all point in the same direction on leadership, culture, relationships, wellbeing, conflict and reporting, while ministers sit lowest each time, that consistency is the evidence on its own — whatever the precision of any single percentage. One figure might sit anywhere within a margin. A pattern that holds across five differently-worded questions is not so easily explained away.
Reading honestly in both directions
First, a discipline worth holding to, since the argument only earns its weight by observing it: not every question is a warning. Some results are genuinely divided, and a divided result isn’t a negative one. Where about as many expect improvement as expect harm, the honest word is “split,” and it would be wrong to count the negative half while setting the positive half aside. So here is each question sorted by what it actually shows. Naming the split ones as split is what makes it fair to name the lopsided ones as lopsided.
Question
Positive
Negative
Reads as
Q8 — leadership quality (non-minister roles)
3–17%
47–57%
Lopsided — negative ~3× positive
Q9 — church culture (non-minister roles)
14–22%
56–70%
Skewed negative
Q11 — personal wellbeing (ministers)
25%
20%
Genuinely split — near wash
Q13 — conflict / bullying / harassment
3–11%
24–47%
Lopsided toward increase
Q14 — willingness to report
3–9%
20–53%
Strongly asymmetric
Granting the washes doesn’t weaken the case; it clarifies it. Ministers’ own wellbeing answer (Q11) is close to even — said plainly, because it is true. The reporting and conflict questions are not even close.
That is the thing to hold onto. The argument doesn’t need every number to lean one way, and several don’t. It rests on the questions that bear on structure and safety — reporting, conflict, the loss of a safeguarding voice — and those are the ones that aren’t washes. A balanced question elsewhere doesn’t settle those, because the balanced questions are not the ones carrying the weight.
What each role’s numbers actually carry
Start with the ministers, because they sit at the low end of the gradient and it would be easy — and wrong — to wave them away. The survey’s ministry-workers column — about 178 ordained ministers and ministry workers in the denomination — drew around fifty responses, about a quarter of that office. That is a fair slice, not a thin or self-selected sliver. So the low minister concern is real, decently sampled, and sincerely held, and I am not going to explain it away. What makes the ministers worth pausing on isn’t that they are unrepresentative. It is that they are reading this change very differently from everyone else who answered — and they sit, with the elders, among those who will vote on it. That is the thread the next section picks up.
The elders sit on firmer ground than the small office columns, and for a simple reason: more of them answered. Around 110 elders responded — far more than the ministers, deacons or paid staff — and that larger sample tightens the band from the ±19 of a 27-person column to something nearer ±9. So where the small columns can be trusted only for direction, the elder figure can be read closer to its face value. Of the elders who answered, around 54% expect leadership quality to deteriorate. No special pleading is needed for that one. It is the most ordinary number in the whole exercise — a clear majority of a decent sample, read at something like its actual value rather than as a wide band — and it leans the same way as the rest.
Deacons need care for one straightforward reason: the size of the sample. About 27 people identifying with the deacons’ court answered — a small base, read as a band, not a figure. That is enough to read which way the column leans, not enough to trust its decimal. And of those who answered, the lean is the same as everyone else’s: around 57% expect things to get worse. Don’t take the 57% to the decimal, and don’t bin the column either. Read it for direction, and the direction puts it with the rest — in fact at the head of it.
Are we all answering the same question?
Here the data turns from broadly agreed to genuinely striking — though not in the way you might expect. The ministers who answered are the outlier, lowest concern on every question, while elders, deacons, members and paid staff all read it as carrying a real cost. That gap is remarkable precisely because it is so disjointed: the people who lead these churches and the people who fill and serve them are looking at the same proposed change and reporting very different things about what it will do.
It is worth asking why, and one possible explanation is worth offering — tentatively, because it concerns how people read a question rather than anything the survey directly measured. Some who support the move to male-only eldership see it as a matter of critical biblical obedience: not a question of cost and benefit at all, but of faithfulness to Scripture. Read that way, the impact question may land differently — a thing one believes is commanded is a thing the mind is less inclined to weigh a price against. Offered as a possibility, not a diagnosis, and not of any individual, it would explain the disjoint without anyone being foolish or self-serving: one group may be answering a question about obedience while the others answer a question about cost.
And obedience is good; you do not refuse to obey because obedience is expensive. If those who support the overture are right that this is commanded, proceeding despite the cost may be exactly faithfulness, and I am not claiming to have settled that. But counting the cost and obeying anyway is not the same as obeying and seeing no cost — and we may not all agree which of those faithfulness requires. The conviction that something is right and the observation that it will be costly are not rivals; a church can hold both, and most of the survey’s respondents do.
This matters for the decision, though not in the way it would if ministers simply outvoted everyone. They don’t. At Assembly the floor is, in principle, shared: each charge is represented by a minister and an elder, two voices of equal standing in the vote. So the room that decides is not the low-concern reading writ large. It is split down the middle — and the elder half of it is among the most concerned in the whole survey, with a clear majority expecting the quality of leadership to fall. The disjoint the data found across the church runs straight through the body that will vote on it.
That is the thing worth sitting with. The elders who see a cost are not petitioners outside the room; they are full voters inside it, with every right to carry what they have seen into the decision. And the ministers, who are half that voice, have reason to listen hard to the other half — because the people who would absorb this change are telling them, in the church’s own consultation, that they expect it to be felt. The survey gave everyone an equal say; the floor gives ministers and elders an equal vote. On the evidence here, both halves of that floor have cause to take the concern seriously, and the elders in particular have the standing to act on it.
Paid staff: the people no one disputes are covered
It is worth pulling paid workers out and looking at them alone, and not out of sentiment. The Work Health and Safety duty reaches further than paid staff: in law a “worker” includes volunteers, so a great many more people in the life of a church are covered than the payroll alone. Some will argue the church shouldn’t have to count volunteers — but that is an argument with the legislation, not with this piece, and I don’t need to win it. Set the whole question of volunteers aside. Even a reader who thinks the survey cast too broad a net by asking the entire church still has to reckon with the paid staff, because about them there is no dispute at all: they are employees, and there are a lot of them — 311 paid through central payroll in 2026, up from 292 the year before and 262 the year before that. Whatever your theology of eldership, they sit inside the WHS framework this consultation was convened to address. An elder’s discomfort is a governance matter. A paid worker’s psychosocial hazard is a duty owed by a person conducting a business or undertaking. The survey gives them their own column, and it repays reading.
Paid Staff — selected responses
Figure
Reads as
Q8 — change will deteriorate leadership
47%
Near half
Q9 — negative culture impact
56%
28% “some” + 28% “significant”
Q10 — yes, impacts relationships/teams
61%
Clear majority
Q11 — some/significant negative wellbeing impact
37%
20% + 17%
Q14 — less likely to report issues
31%
Nearly a third
Paid staff also reported the highest “always feel safe” figure (72%) — safety and support are not the same thing, a distinction many respondents drew explicitly.
And paid staff are not the cohort with the governance vote, nor mostly the female-elder congregations the report flags as over-represented. They are the people who turn up to work in these churches. Their column is small — about 27 answered — so read its figures as bands, not points, and let the legal principle carry the weight rather than any single percentage. The principle doesn’t need a large sample: paid staff are employees, and the duty is owed whether 27 of them answered or 270. When a column like theirs leans, as it does here, toward expecting people to be less likely to report conflict, bullying or harassment, that lean matters. The national model code of practice on managing psychosocial hazards — the more recent of the two, even if NSW hasn’t yet adopted its wording — recognises that this kind of conduct is routinely under-reported, out of fear of blame, escalation or reprisal, and that an absence of reports doesn’t mean nothing is wrong. It treats under-reporting as a marker of poor culture and a barrier to seeing hazards at all. The survey contained responses consistent with exactly that concern, from people the framework plainly covers — and they sit below the headline rather than beside it.
The finding that deserves the headline A consultation conducted under the banner of work health and safety — convened because of a safety concern — produced data in which, of those who answered, a plurality of several cohorts (around 53% of deacon respondents, 41% of members, 33% of elders, 31% of paid staff) expect people to become less willing to report conflict, bullying or harassment if the change is made, and that all-male sessions would be worse at seeing and handling it. In a church on the other side of a Royal Commission that examined exactly how unchallengeable authority and barriers to disclosure let harm continue, that is not a detail. Every safe-ministry structure the denomination has built assumes people will report. The summary leads with “91% feel safe,” which is true; but the reporting result is the one a safety process most needs to see, and it is worth bringing up to the light beside it.
Where the summary and the sharper reading part
The committee’s summary is accurate, and nothing in it is false. But a summary makes choices about what to foreground, and these choices lean a consistent direction — toward the reassuring reading, with the sharper edges set a little further back. Five places show it.
The hardest comments are summarised, not shown. The trauma and abuse responses at Q9 are withheld “to ensure such responses are not identified” — a fair privacy rationale, but the effect is that the most serious material is the least auditable. The same is true of the “don’t trust current male leadership” subset at Q8 and the female elders who said they would feel ruled “unbiblical/evil.” We are told these exist; we cannot weigh them.
Safe is quietly substituted for supported. The committee notes “in passing” that a large number of respondents marked themselves safe but not supported. That distinction is buried under the 91% headline, when it is arguably the more important of the two for a WHS frame.
The sample skew is acknowledged, and it is worth testing rather than waving past. The report notes that female-elder congregations are over-represented (20% of churches, 31% of responses) and that the strongest predictor of perceived risk is whether a church currently has women elders. That is a fair caution against reading the raw negatives at face value, and it deserves a real answer rather than a rhetorical one. So here is the test — illustrative rather than a formal weighted analysis, but enough to show which way the correction cuts. The over-representation is an excess of about 11 percentage points (31% against an expected 20%). Take the least favourable assumption: that the entire 11-point excess would vanish from a properly weighted sample, and that it fell uniformly on the negative side — the worst case for reading the data as concerning. Strip that weight out of the pooled figures and the leans still hold: a 54% or 57% drops toward the mid-to-high 40s — still a plurality, still elevated, still pointing the same way. The finding was never a bare majority clinging to the 50% line; it was a wide and consistent lean, and an 11-point adjustment on the most-exposed slice doesn’t erase it. And the correction touches only the pooled aggregate. The role gradient is computed within each office column — it is a property of which office a respondent holds, not which church-type they attend — so the over-representation doesn’t enter it at all. The skew affects the “All Respondents” pie charts; it leaves the gradient, where the real signal lives, untouched. Run the test the caution invites, and the reading holds.
The same skew cuts the other way, too. The female-elder churches answered at about one and a half times their share, and it is worth asking what that over-representation contains. These are the only respondents speaking from current experience of the arrangement the change would remove. Others write from past experience, or from watching other churches; these congregations have women in session today, and would feel the loss directly. So their responses may reasonably be read as particularly informative — not because they should carry extra statistical weight, but because they reflect lived experience of the very arrangement under discussion. The correction above already shows the leans survive even if their excess is stripped out entirely. What this adds is only that, set alongside the correction, their testimony is the survey’s closest thing to first-hand evidence about what removal would mean — and that is worth noticing, not discounting.
The aggregate is foregrounded; the role split is “texture.” Every flagship chart is “All Respondents.” The role tables are present but never assembled into the cross-question comparison that reveals the gradient. The structure of the document makes the finding hard to see.
What the change closes has a size, and it’s 12% today
Until now the place women hold in the courts of the church — the standing by which they presently sit in session as of right, rather than by an elder’s invitation — has been an abstraction for many of us in this debate. Not because the figure was unknowable; the numbers were there for anyone who went looking. It is that, like much else here, few had gone looking, and fewer still had put it in front of the people arguing the question. The official figures let us put a size on it. The denomination’s 2026 snapshot records about 12% of elders as female; against an estimated 700 elders that is on the order of eighty women — roughly one in eight — though the firmly-counted figure, from the congregations that actually returned their forms, is a little lower, around sixty-six. Call it somewhere between seventy and eighty-five, denomination-wide. Whatever the exact number within that band, it is small, and the proposed change grandfathers those women while closing the door to any more.
And the narrowing is not only a future event waiting on the vote. It has, in places, already begun. Female-elder churches are a minority to start with — about one congregation in five — and even among those, some will not appoint new women in anticipation of the change, while in some churches women who would serve as elders already cannot. None of that is necessarily wrong: if the theological frame is male-only eldership, this is simply that conviction being lived out ahead of the formal decision, and the people doing it are acting in good conscience. The point is only that we cannot discuss this standing as though it were intact and waiting. It is contracting now, by retirement and by choice, and every female elder lost — to relocation, to age, to a session that won’t replace her — is, under the proposed rule, unreplaceable. The number is small, and it has already started to fall.
That reframes what the female-elder churches are telling us. When those congregations say they would lose something real, we can now put a rough number on the something: somewhere around eighty women, about one in eight elders, with no way to replace them under the proposed rule and only one direction to travel. These are also the only respondents speaking from current experience of what is at stake — women in session now, not a memory of them — which is about as close to evidence as a survey of anticipated effects can get.
“Not enough men” was not hypothetical
Several respondents — ministers and members both — raised that in some congregations, particularly regional ones, the people suitable for eldership are sometimes women, and removing them would leave those churches short of leadership altogether. That concern is not idle. The denomination already carries real leadership-supply stress: of 183 pastoral charges, around 19% are genuinely vacant once session and CGW appointments are counted, and three-quarters of the un-appointed vacancies are regional. Whether those are the same churches the eldership change would touch is not something the figures tell us, and this piece does not claim it. But a denomination already short of available leaders, regionally, has less margin to absorb the loss of any eligible candidate — which is enough to make the “not enough men” concern a live structural risk rather than a hypothetical.
The church reading, kept honestly
None of the above settles the theological question, and it isn’t trying to. A great many respondents who expect a negative impact say so while affirming that the change may still be the right and biblical one — the elder who writes that obedience to Scripture and leadership quality are simply different axes is making a coherent point, not dodging. The survey was explicitly not a referendum on whether the change is correct, and this piece isn’t one either. A complementarian can hold that male-only eldership is commanded and that the data here describes real costs that fall hardest on women, on paid staff, and on the churches that will have to absorb the change. Those two things are not in tension; the survey is full of people holding both at once.
What the data does ask of the church is narrower and harder to refuse. If the change is made, the survey is close to unanimous on one point across every role and every theological camp: it cannot be made well without deliberate structures — protected avenues for women’s voices in the courts of the church, strengthened diaconal roles, clear reporting pathways that don’t depend on an all-male session, and genuine support (not merely assurances of safety) for the people most affected. Respondents who support the change and respondents who oppose it converged here. That convergence, like the role gradient, is a real finding — and it is the one the church can act on regardless of where it lands on the eldership question itself.
The three questions, answered
Strip away the framing and there are three plain questions a reader actually wants answered. Here is what the data supports — and, as carefully, what it doesn’t.
1. Are people concerned in ways that actually matter?
Yes — though not “statistically significant” in the p-value sense, because without published response counts nobody can compute that, and I won’t pretend otherwise. The significance that matters here is whether the concern is real and structured rather than noise, and it is. Among respondents it is the majority view in every non-minister column on the core question, around 47 to 57%. It holds across five differently-worded questions, which makes it hard to dismiss as isolated variation. It is shared right across the church — elders, deacons, members and paid staff all clustered high together — and only the ministers who answered sit apart from it. And it attaches to mechanisms rather than moods: reluctance to report, expected conflict, the loss of a safeguarding voice. Those are operational risks a board would minute, not feelings to be soothed.
The floor matters as much as the ceiling. Set the highest figures aside entirely; the argument survives on the lowest. Across every cohort but ministers, somewhere between a quarter and three-fifths expect negative consequences — and even a quarter is too many to wave off. A church that takes safe ministry seriously doesn’t need a majority to warn it. It needs a credible minority, and it has one in every room. That holds whatever you make of the sampling, because it doesn’t lean on the big numbers being precise.
2. Does the summary show what the data shows?
The conclusions are sound — and worth saying so plainly, because the committee’s report does good work: it sets out real options and calls for structures to be built. The question here is narrower, and it is about the summary of the data rather than the committee’s judgement. The safety headline is fairly reported and stable across church types. But a reader meets the reassuring number first and has to go looking for the harder one. The summary leads with safe and gives less room to the distinction respondents kept drawing between safe and supported. It foregrounds the aggregate, where the role gradient that carries the signal does not appear. And the Q14 reporting result — a plurality across several roles expecting people to become less likely to report conflict, bullying or harassment — sits below the headline rather than beside it, though it is just the kind of finding a work-health-and-safety process exists to surface. None of this is to fault the people who wrote it; summarising contested data under real pressure is hard, and a reassuring frame is an understandable place to land. It is only to say that the frame and the finding point in different directions, and the finding is the one a safety process most needs to see.
3. Are the concerns founded?
As to cost and risk, yes. The committee notes that the strongest predictor of concern is whether a congregation currently has female elders, and that those churches are over-represented in the responses. That is the reason to read the raw aggregate with caution — and earlier I set out how a reader can correct for it, on the most hostile assumptions, and find the lean still holds. What remains after that test is the most telling thing in the data: the churches most worried about losing women from session are the ones that have women in session now. They are not reacting to an abstraction or a memory; they are living the arrangement the change would remove, and they feel its loss most sharply. That makes sense, and it is about as close to evidence as a survey of anticipated effects can get. What the data does not establish is that harm is inevitable: this measures expectation, not outcome, and the committee is right that good transition structures could mitigate much of it. But respondents on both sides agreed those structures would be needed. The concern is evidenced; the harm is contingent; and what it is contingent on is whether the structure gets built.
What would sharpen this — and what only the committee holds The survey’s own charts give us a good deal: the split of respondents by role, and by gender within each role, both of which I’ve used above. Two things would let a reader go further. First, exact respondent counts — I’ve read the bases off the demographic graph, so they are approximate, and the committee holds the precise figures. Second, and more useful, a cross-tabulation of answers by gender: not just how many men and women responded, but how each answered each question. That is the one cut that would show whether, on any given question, a particular gender in a particular role was markedly more concerned — and it is not in anything released. Both sit with the committee. Releasing them would let anyone replace my approximations with the real numbers and test every claim here. That would serve the whole church, and it would be welcomed.
From what the data shows, to what we must do
Everything to here has been analysis, and analysis only goes so far. It can tell us what the people who answered expect, how consistently they expect it, and where the weight falls. It cannot tell us what to do about it. For that a church doesn’t turn to a survey, or to the Work Health and Safety Act, or to a committee’s framing. It turns to Scripture. So the rest of this changes its footing on purpose. We have read the data carefully and held it loosely where it was thin. Now comes the harder question — what faithfulness asks of us — and the church has to answer that as the church.
Make the change to eldership if that is what the church, under God’s Word, discerns it must do. I have never contested that, and I do not contest it now. But the data — broad, consistent, leaning one way across every cohort but the ministers who answered — describes a cost that respondents across the church expect, and one they expect to fall hardest on women, on paid staff, and on the congregations that have actually lived the arrangement now being removed. Deciding in obedience is one thing. Doing it well is another, and that depends on whether we build what the change requires. Building it is obedience too.
For we are not first a workplace discharging a statutory duty. We are a people under a Lord who commands the protection of the vulnerable and the restraint of those who shepherd. “Speak up for those who cannot speak for themselves,” says Proverbs; “defend the rights of the poor and needy.” Ezekiel’s indictment falls on shepherds who ruled the flock with force and did not bind up the injured. Peter charges the elders to shepherd the flock “not domineering over those in your charge, but being examples.” The Westminster divines, reading the sixth commandment, named among its plain duties the “comforting and succouring the distressed, and protecting and defending the innocent.” None of this is borrowed from the culture. It is older than the WHS Act by millennia, and it asks more of us than the Act ever could. Where the best secular standard now says a workplace must make it safe to speak up, it is, belatedly, agreeing with our own book.
And it is easy to miss that Presbyterian government is itself a piece of safeguarding. The plurality of elders, the graded courts that hold one another to account — we have these because the tradition recognised long ago that fallen leaders need accountability, and that no one should hold unchecked power over another soul. Narrow who sits in those courts without strengthening the ways the vulnerable reach them, and you weaken the very protection the system was built to give. The answer isn’t to reopen the question of the courts. It is to stay consistent with the logic that gave us courts at all.
What faithfulness asks for here is concrete, and it is worth being precise about, because some will rightly say the denomination already has external mechanisms — the Conduct Protocol Unit, Breaking the Silence, channels that reach past the local session. That is true, and to the church’s credit. But three things must be said about them. The first is that the survey is not measuring whether a pathway exists; it is measuring whether people believe they would be heard — whether someone who is hurt trusts they will be listened to, believed, and not made to pay for having spoken. The second is that these external channels are a backstop, and a backstop that, in practice, often routes a matter back to the session to be dealt with. So they do not let the session off the hook; they lean on it. The third is that, for much of what actually wounds people, the formal machinery is the wrong size and the wrong shape. It is built for the serious and the broken-down. Most of what hurts people in a church is smaller and more ordinary than that — the concern brushed aside, the decision made without them, the sense of not being heard — and reaching for a denominational protocol unit over it can feel too heavy, or simply too alien, especially in smaller or more informal congregations. So it goes unused, and the hurt goes unspoken.
Which means the real ask is not a better way around the session. It is a session people do not need to go around. What we should want — what Scripture asks of elders directly — is that the men who sit in session be loving, kind, patient, and like Christ: shepherds who do not lord it over those in their charge, who bind up the injured rather than rule with force. We should want local congregations to be places where someone who has been hurt is listened to and cared for by their own church, where the ordinary wound is tended before it festers, and where the external supports can sit there, available and trusted, precisely because they are rarely needed. That is a far harder thing to build than a reporting form. It is the work of character, not procedure — and it is exactly the work the office of elder is for. The safeguarding the survey asks for and the eldership Scripture describes turn out to be the same thing: elders people are not afraid to come to.
Around that sit the supports that make it real. Access for women, as of right rather than at an elder’s discretion, to the bodies that handle pastoral care, conflict and discipline. Newer diaconal roles encouraged, taken up, and given time to establish, rather than left as a provision on paper. Genuine support for those most affected, not just the reassurance that most people already feel safe. None of this touches the theology of eldership. All of it is what the data says is needed, and what Scripture says is right — and for a church those are one thing, not two.
And none of this is a fringe demand. The committee running the consultation, setting out the paths open to the Assembly, names exactly this one: to link any change to formal ways of including women’s voices, so that the current standard of safety is maintained or improved. They observe it is the option that would best fit what respondents asked for, were a change to be made — while rightly noting that asking for structure alongside a change is not the same as asking for the change. That is the path this reading points to as well. Build the structure into the change, not after it. The body closest to the data has put the same option on the table.
None of this is new. When the results of this survey did not yet exist — when all we had were the questions it asked and the evidence that bore on them — I set out what I thought would need to be true before this change could responsibly be made: that a real mechanism for women’s voices in the courts of the church be built and tested before the door closes, not promised afterward; that our safeguarding be shown to work rather than merely assumed to; that there be a way to raise a concern that does not depend on the goodwill of the very people it might be about; that the church know which path a given hurt is meant to travel; and that the commitments we have already made to honour men and women alike be carried out, not left on paper. I made that case from governance research, from public inquiries into institutional harm, from the experience of churches that had already walked this road — and I was careful to call such evidence suggestive rather than proof, because perhaps we were different. That was the honest caveat. The survey has now answered it. What we know now is that the external evidence was never only external. The pattern the research described, our own people have described back to us — in their own pews, in their own words. We are not the exception the caveat left room for. The outside was a picture of the inside all along.
And I must say plainly that more has been done than I knew when I first wrote, and that it is good. At presbytery level there is now a compulsory female representative for safe-ministry, and a compulsory female support person for anyone who makes a complaint; presbyteries have been urged to keep a list of women able to support complainants, and there is provision for female presbytery affiliates. At Assembly, deaconesses and key women have been given rights described as equivalent to an associate member, a 2025 overture would extend that to women serving on committees, a committee is examining bullying at the local-church level, and the women’s engagement committee has brought overtures to widen women’s standing further still. At the session level, churches are being encouraged to establish deacons’ courts, to invite women to attend session, and to appoint both men and women as safe-ministry representatives. None of this is nothing. The denomination is not standing still, and the direction is right. I’m glad of it, and I won’t take any of it back.
And it is not only the formal measures. In many churches, ministers and sessions have taken this conversation seriously and handled it well — listening carefully, leading their congregations through hard ground with patience, making space for women’s voices long before any rule required it. Their congregations and communities have noticed, and have thanked them for it. That work is real, and good, and I don’t want a word of what follows to be heard as discounting it. If anything, it is the proof of the point: these are the churches showing what it looks like done well. The question the survey raises is not whether it can be done well — plainly it can, and is — but whether the people most affected can count on it being done well everywhere, by whoever happens to be in the chair, rather than where the right person happens to be sitting in it.
Because hold it up to the light and two things show through. The first is that almost none of it is yet finished, and much of what exists grants a voice without a vote. Some is encouragement a church may take or leave. Much of the rest gives rights “equivalent to an associate member” — and that phrasing is worth understanding, not as a slight but as a mechanism. An associate member is someone who already holds a full vote in another court of the church and is recognised, and seated, when visiting this one. The status is dignified precisely because the vote lives at home. To grant women rights equivalent to that is to extend the recognition — a real and good thing — but there is no home court underneath it, no vote anywhere for the equivalence to rest on. It copies what an associate looks like in the room, not what an associate is. The opportunity to speak is a genuine gift. The opportunity to have a say is a greater one, and these measures, real as they are, stop at the first. A good deal more, too, is still proposed rather than enacted — real intent that has not yet reached the Code where it would become something a person could rely on as of right. Being in motion to a destination is not yet having arrived. The committee itself, weighing the options open to the Assembly, makes a kindred observation: that across twenty years or more of this debate, only a very small change has so far been adopted — the granting of associate-equivalent rights to deaconesses and committee members. That is a candid measure of the distance between motion and arrival, set down by the body running the consultation.
The second is less obvious, but perhaps more telling. I showed an earlier draft of this piece to a friend, and they were able to gather these efforts together for me. Echoes of them had surfaced in earlier conversations — but this was the first time anyone had put a concrete list in front of me. And it is not that I hadn’t looked, or hadn’t asked. I am not standing at a distance from this process. If it takes someone assembling a bespoke list before an engaged member can see the progress whole, then the question is no longer only whether these things exist. It is how known they are, and how trusted, by the people they are meant to protect. A safeguard that the committee that built it understands, and the person in the pew has never heard of, is not yet doing the work of making anyone feel safe to speak. That is the same gap the survey found, one court higher: the distance between a thing that exists and a thing that is known, trusted, and reached for when it is needed. The progress is real. It is also, so far, strongest where the hurt is most distant, and least settled exactly at the session, where the survey says the hurt is closest.
I want to be honest about the limits of my own judgement. I don’t know whether what has been done is enough — whether it truly meets the concerns the survey surfaced, or only gestures toward them. It may be that the work is sound and only its visibility is lacking; if so, that is good news, and I will gladly be wrong. But I don’t think we are there yet, and I’ll say why, because the reason is one anyone can weigh. A church is being asked to trust that its courts, and the culture around them, are healthy enough to hear the vulnerable and treat them well. The clearest evidence we have about that culture is the way this very debate is being conducted — its tone, and the calibre of the discourse around it, in our courts and online. And that conduct does not yet read like a culture that has arrived. A conversation about caring for the vulnerable that is not always carried on with care is itself the answer to whether the structures can simply be trusted to work. I hold that as a sense, not a proof. But it is a sense with public evidence behind it, and it tells me there is still work to do.
I don’t have a vote in this. A good many of us don’t. But a vote and a voice aren’t the same thing. A church that remembers the difference, and makes room for the voice of those who have no vote, is only being what Presbyterians have always said the church is — a body where the weakest members are indispensable, and where no one is heard only when someone else allows it. Make the change, if it must be made. But build the structure first. The survey has told us, in the church’s own words, what its people expect it to cost if we don’t. Scripture told us long before the survey did.
Notes on method and sources. (1) All figures here are partial returns, not absolute totals — the survey sampled attenders, and the denominational counts are built only from congregations that reported. Percentages should be read as “of those who answered,” not as population proportions. Approximate respondent bases, read off the survey’s demographic chart: lay (member/attender/volunteer) ~1,570; elders ~110; ministers/PW/CGW ~50; paid staff ~27; deacons ~27. Gender split, read off the same chart and cross-checked against the ~56% female headline: overall ~56% F / ~42% M; lay ~60/40; paid staff ~52/48; deacon ~64% F; elder ~20% F; minister ~9% F. No published cross-tabulation of answers by gender exists, so no claim is made about how either gender answered any individual question. At a base of ~27 the 95% margin on a single proportion is roughly ±19 points (so a 47% reads as a 28–66% band); at ~110 it is about ±9 points, and at ~1,570 about ±2–3 points. That is why the paid-staff and deacon columns are read for direction (which side of the line the band falls) rather than for their point value, while the elder and lay columns can be taken close to face value. Survey percentages are from the WHS consultation data document, Q8–Q15; the denomination’s own current counts, from the PCNSW 2026 Snapshot to the General Assembly, run to roughly 700 elders (~12% female), 178 ordained ministers/ministry workers, 26 CGWs, 8 deaconesses, 311 central-payroll staff (2026, up from 292 and 262 in the two prior years), ~13,500 adult attenders within a total average attendance of 18,088, 183 pastoral charges, and a real vacancy of about 19% (three-quarters of un-appointed vacancies regional). The estimate of around eighty female elders is 12% of the ~700 total; the firmly-counted figure within the 144 reporting congregations is about 66. (2) The minister low-concern reading is offered as a divergence in how the cohorts read the question — some ministers framing the change as a matter of obedience rather than cost — not as a claim of minister under-representation, nor as an imputation of motive. (3) Elder figures rest on ordinary sample strength (a large office, an unremarkable sample), not on any finite-population or census claim. (4) The deacons’-court column rests on about 27 self-identified respondents — a small base, read for direction and consistency with the gradient only; no precise-value, size, or representativeness claim is made for it. (5) Net-negative figures for Q9/Q11 sum the “some” and “significant” negative bands; these additions have been checked against the survey tables (e.g. paid-staff Q9 = 28% + 28% = 56%; minister Q11 positive 25% / negative 20%). (6) The 1,787 / 117-charge figures are from the survey results document itself. (7) The reporting claim is framed to the Safe Work Australia national model code of practice on managing psychosocial hazards (2022), which treats under-reporting of bullying/harassment as a marker of poor culture and a barrier to hazard identification; the approved NSW code (2021) does not yet use this wording, and the text says so. The claim is deliberately not pinned to the NSW Regulation, which does not name reporting behaviour. (8) The central conclusion — a consistent, directional, cross-question gradient with ministers lowest and every other cohort elevated — is a property of the responses and does not depend on any population denominator; the partial-returns caveat does not weaken it. (9) The committee’s summary of the results, its four risk-management paths (including the option of linking any change to formal structures for women’s voices), and its observation that twenty-plus years have produced only associate-equivalent rights reflect the Elders Consultation Special Committee’s reporting to the Assembly (Ben Greig, Convener). Where the committee’s earlier report and the final survey results differ, the final results are treated as authoritative. The committee’s note that responses were not drastically different by gender, age or role is consistent with the reading here: most roles cluster together, and the analysis isolates the single cohort — ministers — that sits apart.
Some of this analysis was prepared with the assistance of AI tools, used for research, data-checking, and drafting. The argument, the judgements, and the position it reaches are mine, and I take responsibility for them.
I didn’t realise at the time the way the words I had read would affect me. And it’s not necessarily their fault. I have pre-exisiting sensitivities to topics, situations and interactions where the engagement makes me feel lesser. That’s not always the intention or the other party. But some people don’t know how to play the ball, and instead play the player and the ball.
Sometimes the side step is not intentional. I don’t know this person. I don’t know what they think of me. I have seen them speak with passion and conviction, and believe that they are sincere and genuinely doing what they think is right and best.
But are they?
Sinner. Sufferer. Saved. We all are. Thank you Bern. Thank you Sylvia.
I was listening to an audio book last night. It reminded me of how I am made. How my mind works. How my first reaction is to see me as a sinner; wrong and not worthy of good. There is no condemnation for those in Christ Jesus.
And yet…
Who is he that condemns?
He is me. The part of my mind, my thoughts, my soul that internalises the view that I am exactly that; not worthy; not of worth; not of value; “not a burden to any of you”. Christ has died to redeem me; but I am still that fallen, sinful, wretched creature that I was.
The tension; the one that says that this can be true, and the other – that I am save, redeemed, holy, can also be true. But my heart? It doesn’t feel that truth.
So don’t trust your feelings. Don’t believe what your emotions tell you. Know that your faith is anchored in fact, and not the emotional winds that blow you around like a wave of the sea. Because if you do, your faith is weak; and you are double minded. Not worthy. Not of worth. But of course; that’s not what James means. But is it?
This sounds more broken; more empty; more worthless than I feel. And yet this part of my mind nags at me; and I sit with this tension.
I live in the tension between the now; and the not yet. I feel that tension and I love that tension. The sense of hope; that the everlasting light – pure and holy – will be present. Where fear is erased… but it must be in another place. But yet; I struggle with the now. I struggle to see that in the face of criticism, and the weight of accusation that perhaps I am the one who is wrong.
I struggle to see past the words that speak of my heart and intent; that assume wickedness and instability. I struggle to see past the words that speak to my innermost fears and that identity that sits below the surface.
I struggle with the fact that I will sit in a room with this person, and have no voice; no opportunity to speak; and no recourse. I struggle with the almost certain knowledge that my words will be weaponised against me and against those I love, and they will feel the need to defend me – and I will listen, and yet I will be without voice.
<p class="wp-block-paragraph">The PCNSW’s WHS consultation on the proposed change to restrict future eldership to men closes on 8th June (extended from the original 25 May). I have submitted my response. Many of you reading this also have — and many of you reached the survey, directly or indirectly, through something I wrote.</p>
<p class="wp-block-paragraph">I want to use this space to do three things. To thank you. To say where I’ve landed. And to be honest about what this season has cost, both for me and for the people who have walked through it.</p>
<h2 class="wp-block-heading">To those who engaged</h2>
<p class="wp-block-paragraph">If you responded to the survey because the companion guide helped you see why it mattered — or because someone you know read it and brought it to you — thank you. I am genuinely grateful. Many women in leadership roles across PCNSW churches told me directly that they only knew the survey existed because of something I wrote or shared. I know the actual reach has been much wider than that, because resources have a way of travelling further than their authors ever see. To everyone who engaged, in whatever form that took: thank you.</p>
<p class="wp-block-paragraph">I want to acknowledge something specifically. Many of you responded at real cost. You have told me that you wrote things in the free-text fields that took a long time to put into words, and that were difficult to commit to a survey form, even a confidential one. Some of you took the risk of being honest about your own experience of leadership in our churches, knowing that the data would be read by people you don’t know, and presented to an Assembly you may not feel knows or sees you. That took courage, and I do not think I have to tell you that. I think you know.</p>
<p class="wp-block-paragraph">Whatever the Committee does with the submissions, what you wrote matters. It will continue to matter after 25 May. You said the thing. The thing is now on the record. That is not a small thing.</p>
<h2 class="wp-block-heading">Where I have landed</h2>
<p class="wp-block-paragraph">The companion guide was an attempt to set out the evidence fairly. I think it succeeded at that. But the survey response I submitted was something different — it was an attempt to say what I actually think. And I want to say some of that here, because anyone who took the guide seriously deserves to know where its author ended up.</p>
<p class="wp-block-paragraph">I have landed in this place: we are already poorer as a denomination for how we treat women.</p>
<p class="wp-block-paragraph">Not because of the proposed change, which has not yet been made. But because of patterns that already exist in our churches — patterns I have seen with my own eyes, and patterns I have heard about from people I trust enough to listen to. I have heard from women who have brought serious things to their sessions and been told to get over them. I have heard from women whose ministry pathways have narrowed rather than widened, not through any single decision, but through a steady pattern of opportunity going elsewhere. I have heard from women who have been told their opinions don’t matter, sometimes in words and more often in the structural shape of how a meeting was run. I have heard from women who have had unwanted pursuit by men in our churches and not heard well when they raised concerns about it. It is not my place to share any of those stories in detail, because they are not mine to share. But they are real, and there are more of them than I wanted to see. The cumulative weight of them is something I am still sitting with.</p>
<p class="wp-block-paragraph">The academic literature describes more or less exactly what I have been hearing. The Royal Commission described it nearly a decade ago. Professor Tuckey’s research describes it. Our own PCA’s 2020 survey described it. We have not lacked the evidence. We have lacked the willingness to receive it.</p>
<p class="wp-block-paragraph">The proposed change does not create these problems. It formalises them. That is the frame I would want people to hold as they wait for the Committee’s report. The question is not “what risk does this change create?” The question is “what is already happening in our churches, and does this change help or does it make things worse?”</p>
<p class="wp-block-paragraph">My honest answer is that it makes them worse. I cannot see the mechanism by which closing the formal possibility of women on session improves women’s confidence to bring difficult things forward. I cannot see how it improves the quality of governance in sessions that are not currently well-governed — and as someone who sits on boards in business and not-for-profit contexts, I can tell you that most boards are not well-governed by default. They get there through the kind of scrutiny that comes from having voices in the room that ask the questions no one else will ask. I cannot see how the change improves our witness to people watching us from outside — and people are watching, more carefully than we sometimes notice.</p>
<p class="wp-block-paragraph">The Reformed tradition has always known that authority without accountability is dangerous. The Reformers did not consolidate authority and design accountability afterwards. They built the accountability first. If this change proceeds — and it may — it has to be done in that order. From the public material I have seen, we are not yet doing it in that order. The conditions that would need to be in place — binding alternative participation mechanisms for women, independent evaluation of our safeguarding training, complaint pathways trusted by the people who would need them — are not yet there. We are being asked to make the structural change without the structural supports. That is not how the Reformers would have built it, and I do not think it is how we should be building it now.</p>
<p class="wp-block-paragraph">I have had multiple conversations with my non-Christian friends and coworkers about the Church’s position on these, the debate and the overall way that their attitude to women is seen. I hope and pray I have been able to do some work to repair the perception they have of how our church loves the people who are part of it.</p>
<h2 class="wp-block-heading">On Breaking the Silence</h2>
<p class="wp-block-paragraph">I want to be clear about one thing, because it has shaped how I have thought about all of this. Breaking the Silence is excellent. It has helped me understand power and abuse and the responsibilities of leadership; it has given me a framework to process things I have needed to process; and I think it is one of the best resources of its kind in the Christian church anywhere. I am one of its strongest advocates.</p>
<p class="wp-block-paragraph">When I was assaulted last year, the framework I have – that started with the training Breaking the Silence gave me – was materially necessary in helping me understand the nature of what happened, my response to it, and the nature of abuse. It has helped me identify patterns and temptations in my own leadership and to materially change them; and to help make and shape decisions in areas where I am involved to choose to do the safe, caring and God-honouring thing, when otherwise we might not have.</p>
<p class="wp-block-paragraph">But after twenty-eight years, the denomination should be able to demonstrate — publicly — that the training actually changes the way leaders behave, not just that it has been delivered. I can say it has changed me, but I regular see places where it has not changed others. We can teach a leader what to do when something hard comes up in their congregation. We cannot, by teaching them, make them want to do it. You can lead a horse to water, as the saying goes. The training exists. The question of whether it has produced the heart and culture change it was designed to produce is a separate question, and I do not think we have asked it carefully enough.</p>
<p class="wp-block-paragraph">Whatever the Assembly decides about the overture, an independent external evaluation of Breaking the Silence needs to happen — and it needs to be public, and it needs to happen within a defined timeframe. The case for that evaluation does not depend on the overture. We should have started it years ago. Linking it to the overture now risks making it look conditional on the vote. It cannot be.</p>
<h2 class="wp-block-heading">On what comes next</h2>
<p class="wp-block-paragraph">The consultation closes on 8th June (extended). The Committee will summarise the responses and report to the Assembly. The Assembly will debate, and a decision will be made, and that decision will go to the presbyteries under the Barrier Act. Many people much more qualified than me will speak to that process from inside it.</p>
<p class="wp-block-paragraph">I want to say to people on every side of this debate: the conversation does not end on 25 May. It does not end when the Assembly votes. It does not end when the presbyteries ratify or reject. The work of being a denomination that treats women as co-image-bearers — which is what the rest of Scripture asks of us, whatever we make of the disputed passages — continues regardless of which way the vote goes.</p>
<p class="wp-block-paragraph">If the overture passes, the church will need to do urgent and serious work to build the structural protections that should have been built first. If the overture fails, the church will still need to do urgent and serious work to address the patterns that this whole process has surfaced — because I am sure that the responses being submitted right now, in whatever form they are summarised, will tell us that those patterns are real and that they will not be made better by leaving things as they are.</p>
<p class="wp-block-paragraph">I have heard, in the past few weeks, from people on every side of this. From women who said thank you. From women who were angry — sometimes at me, more often at the situation. From men who engaged carefully and seriously with what I wrote. From men in positions of authority in our denomination who responded to careful, prayerful arguments in public forums by adding a laughing emoji. I think one of these responses is more pastoral than the other.</p>
<p class="wp-block-paragraph">This has not been an easy season. I have written more than I expected to. I have listened to more than I expected to. I have sat with more than I expected to. I am thankful that my own local church has been willing to sit with this seriously, and I am aware — painfully — that not everyone reading this has had the same experience in their own congregation.</p>
<p class="wp-block-paragraph">To those who have not — I am sorry. The fact that you took the time to respond to a survey from an institution that did not, in your experience, make space for you to be heard, is worth more than you may ever know.</p>
<p class="wp-block-paragraph">We are praying, all of us, for an outcome that honours both Scripture and the people Scripture describes. Whatever happens on 25 May and after, that prayer continues. So does the work.</p>
<p class="wp-block-paragraph">As always, the errors and judgements are mine.</p>
The PCNSW’s WHS consultation on the proposed change to restrict future eldership to men closes on 8th June (extended from the original 25 May). I have submitted my response. Many of you reading this also have — and many of you reached the survey, directly or indirectly, through something I wrote.
I want to use this space to do three things. To thank you. To say where I’ve landed. And to be honest about what this season has cost, both for me and for the people who have walked through it.
To those who engaged
If you responded to the survey because the companion guide helped you see why it mattered — or because someone you know read it and brought it to you — thank you. I am genuinely grateful. Many women in leadership roles across PCNSW churches told me directly that they only knew the survey existed because of something I wrote or shared. I know the actual reach has been much wider than that, because resources have a way of travelling further than their authors ever see. To everyone who engaged, in whatever form that took: thank you.
I want to acknowledge something specifically. Many of you responded at real cost. You have told me that you wrote things in the free-text fields that took a long time to put into words, and that were difficult to commit to a survey form, even a confidential one. Some of you took the risk of being honest about your own experience of leadership in our churches, knowing that the data would be read by people you don’t know, and presented to an Assembly you may not feel knows or sees you. That took courage, and I do not think I have to tell you that. I think you know.
Whatever the Committee does with the submissions, what you wrote matters. It will continue to matter after 25 May. You said the thing. The thing is now on the record. That is not a small thing.
Where I have landed
The companion guide was an attempt to set out the evidence fairly. I think it succeeded at that. But the survey response I submitted was something different — it was an attempt to say what I actually think. And I want to say some of that here, because anyone who took the guide seriously deserves to know where its author ended up.
I have landed in this place: we are already poorer as a denomination for how we treat women.
Not because of the proposed change, which has not yet been made. But because of patterns that already exist in our churches — patterns I have seen with my own eyes, and patterns I have heard about from people I trust enough to listen to. I have heard from women who have brought serious things to their sessions and been told to get over them. I have heard from women whose ministry pathways have narrowed rather than widened, not through any single decision, but through a steady pattern of opportunity going elsewhere. I have heard from women who have been told their opinions don’t matter, sometimes in words and more often in the structural shape of how a meeting was run. I have heard from women who have had unwanted pursuit by men in our churches and not heard well when they raised concerns about it. It is not my place to share any of those stories in detail, because they are not mine to share. But they are real, and there are more of them than I wanted to see. The cumulative weight of them is something I am still sitting with.
The academic literature describes more or less exactly what I have been hearing. The Royal Commission described it nearly a decade ago. Professor Tuckey’s research describes it. Our own PCA’s 2020 survey described it. We have not lacked the evidence. We have lacked the willingness to receive it.
The proposed change does not create these problems. It formalises them. That is the frame I would want people to hold as they wait for the Committee’s report. The question is not “what risk does this change create?” The question is “what is already happening in our churches, and does this change help or does it make things worse?”
My honest answer is that it makes them worse. I cannot see the mechanism by which closing the formal possibility of women on session improves women’s confidence to bring difficult things forward. I cannot see how it improves the quality of governance in sessions that are not currently well-governed — and as someone who sits on boards in business and not-for-profit contexts, I can tell you that most boards are not well-governed by default. They get there through the kind of scrutiny that comes from having voices in the room that ask the questions no one else will ask. I cannot see how the change improves our witness to people watching us from outside — and people are watching, more carefully than we sometimes notice.
The Reformed tradition has always known that authority without accountability is dangerous. The Reformers did not consolidate authority and design accountability afterwards. They built the accountability first. If this change proceeds — and it may — it has to be done in that order. From the public material I have seen, we are not yet doing it in that order. The conditions that would need to be in place — binding alternative participation mechanisms for women, independent evaluation of our safeguarding training, complaint pathways trusted by the people who would need them — are not yet there. We are being asked to make the structural change without the structural supports. That is not how the Reformers would have built it, and I do not think it is how we should be building it now.
I have had multiple conversations with my non-Christian friends and coworkers about the Church’s position on these, the debate and the overall way that their attitude to women is seen. I hope and pray I have been able to do some work to repair the perception they have of how our church loves the people who are part of it.
On Breaking the Silence
I want to be clear about one thing, because it has shaped how I have thought about all of this. Breaking the Silence is excellent. It has helped me understand power and abuse and the responsibilities of leadership; it has given me a framework to process things I have needed to process; and I think it is one of the best resources of its kind in the Christian church anywhere. I am one of its strongest advocates.
When I was assaulted last year, the framework I have – that started with the training Breaking the Silence gave me – was materially necessary in helping me understand the nature of what happened, my response to it, and the nature of abuse. It has helped me identify patterns and temptations in my own leadership and to materially change them; and to help make and shape decisions in areas where I am involved to choose to do the safe, caring and God-honouring thing, when otherwise we might not have.
But after twenty-eight years, the denomination should be able to demonstrate — publicly — that the training actually changes the way leaders behave, not just that it has been delivered. I can say it has changed me, but I regular see places where it has not changed others. We can teach a leader what to do when something hard comes up in their congregation. We cannot, by teaching them, make them want to do it. You can lead a horse to water, as the saying goes. The training exists. The question of whether it has produced the heart and culture change it was designed to produce is a separate question, and I do not think we have asked it carefully enough.
Whatever the Assembly decides about the overture, an independent external evaluation of Breaking the Silence needs to happen — and it needs to be public, and it needs to happen within a defined timeframe. The case for that evaluation does not depend on the overture. We should have started it years ago. Linking it to the overture now risks making it look conditional on the vote. It cannot be.
On what comes next
The consultation closes on 8th June (extended). The Committee will summarise the responses and report to the Assembly. The Assembly will debate, and a decision will be made, and that decision will go to the presbyteries under the Barrier Act. Many people much more qualified than me will speak to that process from inside it.
I want to say to people on every side of this debate: the conversation does not end on 25 May. It does not end when the Assembly votes. It does not end when the presbyteries ratify or reject. The work of being a denomination that treats women as co-image-bearers — which is what the rest of Scripture asks of us, whatever we make of the disputed passages — continues regardless of which way the vote goes.
If the overture passes, the church will need to do urgent and serious work to build the structural protections that should have been built first. If the overture fails, the church will still need to do urgent and serious work to address the patterns that this whole process has surfaced — because I am sure that the responses being submitted right now, in whatever form they are summarised, will tell us that those patterns are real and that they will not be made better by leaving things as they are.
I have heard, in the past few weeks, from people on every side of this. From women who said thank you. From women who were angry — sometimes at me, more often at the situation. From men who engaged carefully and seriously with what I wrote. From men in positions of authority in our denomination who responded to careful, prayerful arguments in public forums by adding a laughing emoji. I think one of these responses is more pastoral than the other.
This has not been an easy season. I have written more than I expected to. I have listened to more than I expected to. I have sat with more than I expected to. I am thankful that my own local church has been willing to sit with this seriously, and I am aware — painfully — that not everyone reading this has had the same experience in their own congregation.
To those who have not — I am sorry. The fact that you took the time to respond to a survey from an institution that did not, in your experience, make space for you to be heard, is worth more than you may ever know.
We are praying, all of us, for an outcome that honours both Scripture and the people Scripture describes. Whatever happens on 25 May and after, that prayer continues. So does the work.
<p class="wp-block-paragraph">I had the privilege of praying in my Church this morning for our Mothers, Mothers Day, our Church and the World.</p>
<div style="background-image:linear-gradient(180deg,rgb(238,238,238) 0%,rgb(169,184,195) 100%);" class="wp-block-group has-background"><div class="wp-block-group__inner-container is-layout-constrained wp-block-group-is-layout-constrained">
<p class="wp-block-paragraph">Heavenly Father,</p>
<p class="wp-block-paragraph">We come before you this morning, knowing that today is a day that carries many things at once — joy and sorrow, gratitude and grief, satisfaction and longing — and we thank you that you are a God large enough to hold them all, and tender enough to hold each of us within them.</p>
<p class="wp-block-paragraph">For many of us, today is a day of celebration. Our Mothers are honoured, our children are gathered close, and love is present and warm and near. I pray this morning as one of those for whom the day is full — as a father, with a family who love Jesus, and I do not take that for granted for a moment. I know what a gift it is — and I know it is a gift, and not a given.</p>
<p class="wp-block-paragraph">We thank you, Father, for the mothers among us — for their patience that outlasts ours, for their sacrifice that we so often only understand long after it has been made, for the fierce and quiet love that has shaped so many of us into who we now are. We celebrate them today, and we are grateful, and we ask that you would bless them.</p>
<p class="wp-block-paragraph">But Father, we know that for others among us, this day is harder to inhabit. There are those in this room who carry grief that this morning will press against — whose mothers have gone, whose relationships are fractured or strained or impossibly far away. There are those who longed to be mothers and are not. There are mothers among us who have lost children — who know an ache that this day will reopen, and a name they will carry quietly through the service this morning as they do every day.</p>
<p class="wp-block-paragraph">There are those who long to be close and find themselves distant, and those who are distant and do not yet know how to come home. We lift them all to you. Be near to the broken-hearted, Lord, as you have promised to be. Let this place — and this hour — be a refuge wide enough for their sorrow as well as our celebration, because both belong here, and both belong to you.</p>
<p class="wp-block-paragraph">We pray too for those whose mothers have gone to glory, and we thank you for the unshakable hope we have in Jesus — that death is not the end, that the grave does not have the final word, and that we will see them again. And Father, we hold before you something heavier still: those whose mothers, or whose loved ones, died without us knowing whether they had come to a saving faith in your Son. That is a weight we cannot carry, and we do not pretend to. We trust your justice, Lord, because it is perfect. We trust your mercy, because it is wider than ours. And we leave them in the hands of the only one who knows them truly and loves them better than we ever could.</p>
<p class="wp-block-paragraph">Father, we pray now for our own church. We pray for growth — and we ask for it on every level. We pray for new families to find their home among us, for new faces to become familiar ones, for our community to deepen and widen at once. We thank you for the way you have already answered this prayer over the last 12 months, and we know you will continue to do so.</p>
<p class="wp-block-paragraph">But more than that, Lord, we pray for new people to come to a saving faith in your Son. Numbers without faith is not the growth we are asking for. Grow us in depth as well as in breadth. Do the slow and patient work of sanctification in us. Edify us through your word and by your Spirit. Let the gospel do its work in our own lives first, before we presume to take it anywhere else.</p>
<p class="wp-block-paragraph">We pray for our kids ministry — for the leaders who give their Sundays to it, for the children who are being shaped by it, and for seeds being planted now that may not bear fruit for decades but will bear fruit, by your grace. We thank you for our kids club and the work faithfully done by the Bells and the Siu’s to make it happen, and the support they have been given from people all around our church family. We pray for the teachers who faithfully give their energy and time every week during term to help lead, grow, disciple and mentor our children during Kids Church.</p>
<p class="wp-block-paragraph">We pray for our new planned ESL class, and the open door it represents to our community. Let it be more than a service, Lord — let it be a place of genuine welcome, of friendship, of dignity, and in your timing, a place where people encounter the love of Christ in word and not only in deed. And we pray for those who are coming to Christianity Explained — for hearts that are softening, for questions that are honest, for eyes that are beginning to open. Bring some of them, Lord, before it ends, into a real and saving knowledge of Jesus.</p>
<p class="wp-block-paragraph">We pray for our denomination, Father, as it wrestles in this season with difficult questions on many fronts. Give it wisdom — careful, humble, biblical wisdom — as it thinks through the role of elder, and what faithfulness to your word looks like in the life of the church. And give it the same wisdom as it looks for new and faithful ways to include women in the decision-making and the governance of your church. Hold it together where it is tempted to fracture. Guard it from the pride that grasps and the cowardice that retreats. Lead it, Lord, into the truth — because the truth is yours to give, and yours alone.</p>
<p class="wp-block-paragraph">And Father, we lift our eyes to the world. We pray urgently — and we mean urgently — for peace.</p>
<p class="wp-block-paragraph">We pray for the Middle East, Lord, where war has come to too many places at once. We pray for Israel and Palestine, where bloodshed has not stopped and grief has not been allowed to settle. We pray for the United States and Iran, for an end to the violence between them, and that it would not widen further than it already has. We pray on every side of every line — for the protection of the vulnerable, for the sparing of children, for the displaced and the frightened, for comfort to those who have already lost more than we can imagine. And Father, we pray for those among us this morning who carry these places personally — who have family caught in conflicts they cannot reach across, who watch the news with a weight the rest of us cannot fully share. Be near to them, Lord. Hold their loved ones in your hands. And bring peace — real peace, your peace — to a region that has known far too little of it for far too long.</p>
<p class="wp-block-paragraph">We pray for Russia and Ukraine, for an end to a war that has gone on too long and cost too much, for justice that does not abandon mercy, and for mercy that does not abandon justice. Lord, have mercy on this world. Lord, have mercy.</p>
<p class="wp-block-paragraph">We pray for the persecuted church — for our brothers and sisters who bear the name of Jesus at a cost we have never had to pay. Strengthen them, sustain them, and let them know that across the world they are remembered, and they are loved, and they are not alone.</p>
<p class="wp-block-paragraph">And we pray, Father, for the going out of the gospel — to the ends of our street, and to the ends of the earth, and to every place between. From our kids ministry to our ESL class, from Christianity Explained to the nations, from this church on this morning to wherever your Spirit will send us next. Let the thread of your good news run through everything we are and everything we do. We have nothing better to offer this world than Jesus, and we have nothing less than Jesus to offer.</p>
<p class="wp-block-paragraph">All of this we bring to you in the name of your Son — in whom all things, our joy and our grief, our church and our world, our beginnings and our endings, hold together.</p>
<p class="wp-block-paragraph">Amen.</p>
</div></div>
I had the privilege of praying in my Church this morning for our Mothers, Mothers Day, our Church and the World.
Heavenly Father,
We come before you this morning, knowing that today is a day that carries many things at once — joy and sorrow, gratitude and grief, satisfaction and longing — and we thank you that you are a God large enough to hold them all, and tender enough to hold each of us within them.
For many of us, today is a day of celebration. Our Mothers are honoured, our children are gathered close, and love is present and warm and near. I pray this morning as one of those for whom the day is full — as a father, with a family who love Jesus, and I do not take that for granted for a moment. I know what a gift it is — and I know it is a gift, and not a given.
We thank you, Father, for the mothers among us — for their patience that outlasts ours, for their sacrifice that we so often only understand long after it has been made, for the fierce and quiet love that has shaped so many of us into who we now are. We celebrate them today, and we are grateful, and we ask that you would bless them.
But Father, we know that for others among us, this day is harder to inhabit. There are those in this room who carry grief that this morning will press against — whose mothers have gone, whose relationships are fractured or strained or impossibly far away. There are those who longed to be mothers and are not. There are mothers among us who have lost children — who know an ache that this day will reopen, and a name they will carry quietly through the service this morning as they do every day.
There are those who long to be close and find themselves distant, and those who are distant and do not yet know how to come home. We lift them all to you. Be near to the broken-hearted, Lord, as you have promised to be. Let this place — and this hour — be a refuge wide enough for their sorrow as well as our celebration, because both belong here, and both belong to you.
We pray too for those whose mothers have gone to glory, and we thank you for the unshakable hope we have in Jesus — that death is not the end, that the grave does not have the final word, and that we will see them again. And Father, we hold before you something heavier still: those whose mothers, or whose loved ones, died without us knowing whether they had come to a saving faith in your Son. That is a weight we cannot carry, and we do not pretend to. We trust your justice, Lord, because it is perfect. We trust your mercy, because it is wider than ours. And we leave them in the hands of the only one who knows them truly and loves them better than we ever could.
Father, we pray now for our own church. We pray for growth — and we ask for it on every level. We pray for new families to find their home among us, for new faces to become familiar ones, for our community to deepen and widen at once. We thank you for the way you have already answered this prayer over the last 12 months, and we know you will continue to do so.
But more than that, Lord, we pray for new people to come to a saving faith in your Son. Numbers without faith is not the growth we are asking for. Grow us in depth as well as in breadth. Do the slow and patient work of sanctification in us. Edify us through your word and by your Spirit. Let the gospel do its work in our own lives first, before we presume to take it anywhere else.
We pray for our kids ministry — for the leaders who give their Sundays to it, for the children who are being shaped by it, and for seeds being planted now that may not bear fruit for decades but will bear fruit, by your grace. We thank you for our kids club and the work faithfully done by the Bells and the Siu’s to make it happen, and the support they have been given from people all around our church family. We pray for the teachers who faithfully give their energy and time every week during term to help lead, grow, disciple and mentor our children during Kids Church.
We pray for our new planned ESL class, and the open door it represents to our community. Let it be more than a service, Lord — let it be a place of genuine welcome, of friendship, of dignity, and in your timing, a place where people encounter the love of Christ in word and not only in deed. And we pray for those who are coming to Christianity Explained — for hearts that are softening, for questions that are honest, for eyes that are beginning to open. Bring some of them, Lord, before it ends, into a real and saving knowledge of Jesus.
We pray for our denomination, Father, as it wrestles in this season with difficult questions on many fronts. Give it wisdom — careful, humble, biblical wisdom — as it thinks through the role of elder, and what faithfulness to your word looks like in the life of the church. And give it the same wisdom as it looks for new and faithful ways to include women in the decision-making and the governance of your church. Hold it together where it is tempted to fracture. Guard it from the pride that grasps and the cowardice that retreats. Lead it, Lord, into the truth — because the truth is yours to give, and yours alone.
And Father, we lift our eyes to the world. We pray urgently — and we mean urgently — for peace.
We pray for the Middle East, Lord, where war has come to too many places at once. We pray for Israel and Palestine, where bloodshed has not stopped and grief has not been allowed to settle. We pray for the United States and Iran, for an end to the violence between them, and that it would not widen further than it already has. We pray on every side of every line — for the protection of the vulnerable, for the sparing of children, for the displaced and the frightened, for comfort to those who have already lost more than we can imagine. And Father, we pray for those among us this morning who carry these places personally — who have family caught in conflicts they cannot reach across, who watch the news with a weight the rest of us cannot fully share. Be near to them, Lord. Hold their loved ones in your hands. And bring peace — real peace, your peace — to a region that has known far too little of it for far too long.
We pray for Russia and Ukraine, for an end to a war that has gone on too long and cost too much, for justice that does not abandon mercy, and for mercy that does not abandon justice. Lord, have mercy on this world. Lord, have mercy.
We pray for the persecuted church — for our brothers and sisters who bear the name of Jesus at a cost we have never had to pay. Strengthen them, sustain them, and let them know that across the world they are remembered, and they are loved, and they are not alone.
And we pray, Father, for the going out of the gospel — to the ends of our street, and to the ends of the earth, and to every place between. From our kids ministry to our ESL class, from Christianity Explained to the nations, from this church on this morning to wherever your Spirit will send us next. Let the thread of your good news run through everything we are and everything we do. We have nothing better to offer this world than Jesus, and we have nothing less than Jesus to offer.
All of this we bring to you in the name of your Son — in whom all things, our joy and our grief, our church and our world, our beginnings and our endings, hold together.
I used to be able to spot an LLM hallucination a mile away. The writing had a whiff about it — too confident, slightly off-topic, a citation that didn’t quite exist. You’d read it, squint, and go: <em>no, that’s not right</em>.
That job is getting harder. Not because the models have stopped making things up. They haven’t. The problem is that they’ve got much better at making things up <em>plausibly</em>.
<h3>Why this is a problem we can’t engineer away</h3>
Every large language model — Claude, ChatGPT, Gemini, Copilot, the lot — is at its core a probability engine. It predicts the “next token” based on everything that came before. That’s a remarkable trick when the training data is rich and the question is well-posed. It’s less useful when the model has to reach for something outside its training or retrieval context, because the same machinery that produces a correct answer also produces a confident-sounding wrong one. The models can represent uncertainty internally — token probabilities give you something to work with — but there’s no reliable mechanism that surfaces “I don’t know” to the user. It’s a calibration and training-incentive problem, not a capability one.
This isn’t just my view. In September 2025, OpenAI’s own researchers <a href="https://arxiv.org/abs/2509.04664">published a paper arguing that hallucinations can be understood, in part, as errors in binary classification</a> — a predictable consequence of how models are trained and how we evaluate them. Accuracy-only leaderboards, they point out, reward confident guessing over saying “I don’t know.” Other researchers have gone further: one group <a href="https://arxiv.org/abs/2401.11817">showed, under formal assumptions about computability and training data, that hallucination is unavoidable</a> in any computable LLM, while <a href="https://arxiv.org/abs/2409.05746">another used Gödel-style reasoning</a> to argue that hallucination risk persists across every stage of the LLM pipeline. Even an earlier <a href="https://arxiv.org/abs/2311.14648">STOC 2024 paper by Kalai and Vempala</a> suggested that rare facts in training data place a lower bound on the hallucination rate of any well-calibrated model. These are theoretical results with real assumptions behind them, but the direction is consistent: hallucination isn’t going to be patched out.
Reasoning models — the frontier stuff from the last eighteen months — help, but they don’t solve it. What they do is take more turns: they draft, critique, redraft, search, reconsider. Given enough compute and enough tool access (web search, code execution, document retrieval), a reasoning model can catch its own errors. Given a single quick turn on a question outside its training data, it often won’t.
And there’s a twist. <a href="https://cdn.openai.com/pdf/2221c875-02dc-4789-800b-e7758f3722c1/o3-and-o4-mini-system-card.pdf">OpenAI’s own system card for o3 and o4-mini</a>, published in April 2025, reported that o3 hallucinates around 33% of the time on the PersonQA benchmark — a difficult factual-recall task — and o4-mini around 48%, roughly double the rate of their o1 and o3-mini predecessors. That’s one benchmark, not a universal regression, but <a href="https://www.nytimes.com/2025/05/05/technology/ai-hallucinations-chatgpt-google.html">the <em>New York Times</em> covered the trend in May 2025</a>: on several benchmarks, hallucination rates in newer reasoning models have gone up, not down. Models are tackling harder tasks and attempting more claims; the absolute number of confident-but-wrong statements rises along with the confident-and-right ones. So the ability to <em>sound right</em> is increasing faster than the ability to <em>be right</em>. That’s the problem.
<h3>What a hallucination actually looks like now</h3>
Two years ago an obvious hallucination might have been a wrong date, a fabricated quote, a made-up case law reference. You could fact-check it in thirty seconds.
Today’s failures are subtler. A Python script that uses a function with the right name but wrong arguments. A network configuration that’s valid syntax but wrong for your vendor’s firmware version. A summary of a document that captures the shape of the argument but inverts one crucial claim. A legal clause that reads professionally but cites the wrong Act. The shell is convincing. The filling isn’t.
I’ve seen this in my own work, more frequently than I did even a year ago, and colleagues have noticed the same thing. An LLM recently drafted a paragraph for a colleague about a piece of telecoms hardware, confidently asserting that it was “the same ATA our CloudPBX team already ships into office deployments.” We don’t ship it. The model had never been told we did. It reads as the kind of thing a helpful internal colleague would write — specific, plausible, in the right voice — which is exactly what makes it dangerous. In another session, pushed on where a particular claim had come from, the model simply admitted it had made the line up: it had extrapolated from a third-party review and the general reputation of a similar product, then written it in first-person company voice because it sounded convincing. That’s an unusually honest self-report and worth taking seriously. It’s the machinery describing itself: <em>I wrote what sounded right</em>.
This is where the structural point from the last section stops being abstract. It’s also something people often miss: even when you give an LLM good source material to work from, it doesn’t just quote you back. It summarises, re-references, and reformulates internally — and things get distorted in that process. The research community calls this <em>faithfulness</em> hallucination, to distinguish it from the making-stuff-up variety. <a href="https://dl.acm.org/doi/10.1145/3703155">The canonical survey by Huang et al.</a> describes it as “context inconsistency” — cases where the model “ignores or alters important facts within the original text.” A paper from <a href="https://arxiv.org/abs/2505.15291">January 2026 found a tendency for faithfulness to degrade toward the <em>end</em> of longer responses</a> — the model drifts from the source the further it gets into its own output. So that long, fluent, well-cited summary of your policy document? The bit at the bottom is, on the evidence, the most likely place for something to be subtly wrong.
<a href="https://www.oneusefulthing.org/p/15-times-to-use-ai-and-5-not-to">Ethan Mollick has made a related point</a>: because LLM errors are, by construction, plausible, users “fall asleep at the wheel.” And it’s not a new concern — a <a href="https://www.science.org/doi/10.1126/sciadv.adh1850">2023 <em>Science Advances</em> study</a> found that in controlled conditions, participants were more likely to believe AI-generated disinformation than the human-written equivalent. The prose was simply better. That was GPT-3. The gap has only widened.
This is not a failure mode anyone can detect by vibe-checking the output.
<h3>What actually works</h3>
There’s a temptation to treat this as a novel problem requiring novel tools. It isn’t. Academia has been dealing with confident-sounding-but-wrong writing for about four hundred years, and the mechanism we landed on is peer review. Someone else reads your work and tries to break it.
That’s exactly what works with AI output too. A few patterns I rely on:
<strong>Ask the same model to critique its own answer, in a fresh session.</strong> Not “check your work” — that rarely does much. Instead: “Here’s a proposed solution to X. What’s wrong with it? What assumptions is it making? What would cause this to fail?” The fresh session matters, because the model isn’t anchored to defending what it just wrote. This is the idea behind <a href="https://arxiv.org/abs/2303.17651">Self-Refine</a>, a NeurIPS 2023 paper that reported improvements of around 20% on several tasks from iterative self-critique, without any additional training. There’s a catch, though: <a href="https://arxiv.org/abs/2402.08115">a 2024 paper from Stechly, Valmeekam and Kambhampati</a> found that pure self-critique can actually make reasoning performance <em>worse</em> on some tasks — the model talks itself into the wrong answer. Which leads to the next pattern.
<strong>Cross-examine with a different model.</strong> If I’ve had Claude draft a tricky piece of network config, I’ll often paste it into ChatGPT or Gemini and ask what it would do differently and why. The disagreements are where the interesting errors live. The models have different training data, different reasoning styles, different blind spots. Where they all agree, my confidence goes up — but it’s not proof. Frontier models are trained on heavily overlapping corpora and can confidently agree on the same wrong answer, especially on topics where the public internet is itself wrong. So agreement is a signal, not a guarantee; disagreement is what tells me I need to go and read the documentation myself. This pattern is exactly the approach taken in <a href="https://arxiv.org/abs/2305.14325">Du et al.’s “Multiagent Debate” paper at ICML 2024</a>, which found that having multiple LLM instances debate an answer over several rounds measurably reduces hallucination. Google DeepMind’s <a href="https://deepmind.google/blog/facts-grounding-a-new-benchmark-for-evaluating-the-factuality-of-large-language-models/">FACTS Grounding benchmark</a> operationalises the same idea at institutional scale, using an ensemble of Gemini, GPT-4o and Claude to judge factuality.
<strong>Make the model do the research.</strong> For anything involving facts that change — pricing, versions, API behaviour, legislation — don’t trust the weights. Make the model search, cite the source, and then check the source actually says what it claims. Retrieval doesn’t eliminate hallucination — an <a href="https://aclanthology.org/2023.emnlp-main.398/">EMNLP 2023 benchmark called ALCE</a> found that even the best systems miss full citation support about 50% of the time, and as noted above, even with a good source document in hand, models can drift from what it actually says — but it massively reduces fabrication when combined with a human checking the citations. Vendors are starting to build this in: <a href="https://simonwillison.net/2025/Jan/24/anthropics-new-citations-api/">Anthropic’s Citations API</a>, launched in January 2025, returns direct quotes from supplied source documents, and at least one customer reported reducing “source confabulations from 10% to zero.”
<strong>Build the friction in before you need it.</strong> The cheapest time to catch a hallucination is before you’ve acted on it. That means reviewing AI output <em>before</em> you paste it into the production config, send it to the client, or quote it in the board paper. <a href="https://simonwillison.net/2025/Mar/2/hallucinations-in-code/">Simon Willison puts it well</a>: hallucinated code is actually the <em>least</em> dangerous failure mode, because the compiler tells you immediately. It’s the hallucinations that read clean and pass initial review that hurt. Treat the first draft as a draft.
<h3>What this means for how we use AI at work</h3>
At RWTS we do a lot of work where the cost of a confident wrong answer is high — network changes, security configurations, compliance documents. The teams who get good results with AI aren’t the ones who’ve bought the most expensive model. They’re the ones who’ve built review into the workflow. A senior engineer reviews the AI-drafted config. A peer reads the AI-drafted proposal before it goes out. The AI is a junior colleague with a photographic memory and occasional blind spots, not an oracle.
That framing isn’t mine — <a href="https://www.oneusefulthing.org/p/on-boarding-your-ai-intern">Ethan Mollick has been arguing for years</a> that LLMs are best thought of as “weird, somewhat alien interns that work infinitely fast and sometimes lie to make you happy.” <a href="https://simonwillison.net/2025/Oct/7/vibe-engineering/">Simon Willison goes further</a> and calls them “a growing army of weird digital interns who will absolutely cheat if you give them a chance.” The operational answer, in both cases, is the same: tests, specs, code review, and a human in the loop who actually knows the domain.
That’s the one I’d offer to anyone trying to work out how much to trust these tools. You wouldn’t let a talented graduate push code to production unreviewed on day one. You wouldn’t let them send a client a statement of work without someone else reading it. The same rules apply.
Hallucinations aren’t going away. They’re a property of how these systems work, not a bug that’ll be patched out in the next release. The models will keep getting better at sounding right. Our job is to make sure the review process keeps up.
<hr />
<em>A note on process: I drafted this post with Claude’s help, then had a different model review it for overstated claims and weak citations. Several paragraphs got tightened as a result. Which is, roughly, the point.</em>
<hr />
<em>I’m the Director and CTO over at <a href="https://rwts.com.au">Real World Technology Solutions</a>. RWTS helps organisations get real value from AI without betting the business on it. Call us on 1300 798 718.</em>
I used to be able to spot an LLM hallucination a mile away. The writing had a whiff about it — too confident, slightly off-topic, a citation that didn’t quite exist. You’d read it, squint, and go: no, that’s not right.
That job is getting harder. Not because the models have stopped making things up. They haven’t. The problem is that they’ve got much better at making things up plausibly.
Why this is a problem we can’t engineer away
Every large language model — Claude, ChatGPT, Gemini, Copilot, the lot — is at its core a probability engine. It predicts the “next token” based on everything that came before. That’s a remarkable trick when the training data is rich and the question is well-posed. It’s less useful when the model has to reach for something outside its training or retrieval context, because the same machinery that produces a correct answer also produces a confident-sounding wrong one. The models can represent uncertainty internally — token probabilities give you something to work with — but there’s no reliable mechanism that surfaces “I don’t know” to the user. It’s a calibration and training-incentive problem, not a capability one.
Reasoning models — the frontier stuff from the last eighteen months — help, but they don’t solve it. What they do is take more turns: they draft, critique, redraft, search, reconsider. Given enough compute and enough tool access (web search, code execution, document retrieval), a reasoning model can catch its own errors. Given a single quick turn on a question outside its training data, it often won’t.
And there’s a twist. OpenAI’s own system card for o3 and o4-mini, published in April 2025, reported that o3 hallucinates around 33% of the time on the PersonQA benchmark — a difficult factual-recall task — and o4-mini around 48%, roughly double the rate of their o1 and o3-mini predecessors. That’s one benchmark, not a universal regression, but the New York Times covered the trend in May 2025: on several benchmarks, hallucination rates in newer reasoning models have gone up, not down. Models are tackling harder tasks and attempting more claims; the absolute number of confident-but-wrong statements rises along with the confident-and-right ones. So the ability to sound right is increasing faster than the ability to be right. That’s the problem.
What a hallucination actually looks like now
Two years ago an obvious hallucination might have been a wrong date, a fabricated quote, a made-up case law reference. You could fact-check it in thirty seconds.
Today’s failures are subtler. A Python script that uses a function with the right name but wrong arguments. A network configuration that’s valid syntax but wrong for your vendor’s firmware version. A summary of a document that captures the shape of the argument but inverts one crucial claim. A legal clause that reads professionally but cites the wrong Act. The shell is convincing. The filling isn’t.
I’ve seen this in my own work, more frequently than I did even a year ago, and colleagues have noticed the same thing. An LLM recently drafted a paragraph for a colleague about a piece of telecoms hardware, confidently asserting that it was “the same ATA our CloudPBX team already ships into office deployments.” We don’t ship it. The model had never been told we did. It reads as the kind of thing a helpful internal colleague would write — specific, plausible, in the right voice — which is exactly what makes it dangerous. In another session, pushed on where a particular claim had come from, the model simply admitted it had made the line up: it had extrapolated from a third-party review and the general reputation of a similar product, then written it in first-person company voice because it sounded convincing. That’s an unusually honest self-report and worth taking seriously. It’s the machinery describing itself: I wrote what sounded right.
This is where the structural point from the last section stops being abstract. It’s also something people often miss: even when you give an LLM good source material to work from, it doesn’t just quote you back. It summarises, re-references, and reformulates internally — and things get distorted in that process. The research community calls this faithfulness hallucination, to distinguish it from the making-stuff-up variety. The canonical survey by Huang et al. describes it as “context inconsistency” — cases where the model “ignores or alters important facts within the original text.” A paper from January 2026 found a tendency for faithfulness to degrade toward the end of longer responses — the model drifts from the source the further it gets into its own output. So that long, fluent, well-cited summary of your policy document? The bit at the bottom is, on the evidence, the most likely place for something to be subtly wrong.
Ethan Mollick has made a related point: because LLM errors are, by construction, plausible, users “fall asleep at the wheel.” And it’s not a new concern — a 2023 Science Advances study found that in controlled conditions, participants were more likely to believe AI-generated disinformation than the human-written equivalent. The prose was simply better. That was GPT-3. The gap has only widened.
This is not a failure mode anyone can detect by vibe-checking the output.
What actually works
There’s a temptation to treat this as a novel problem requiring novel tools. It isn’t. Academia has been dealing with confident-sounding-but-wrong writing for about four hundred years, and the mechanism we landed on is peer review. Someone else reads your work and tries to break it.
That’s exactly what works with AI output too. A few patterns I rely on:
Ask the same model to critique its own answer, in a fresh session. Not “check your work” — that rarely does much. Instead: “Here’s a proposed solution to X. What’s wrong with it? What assumptions is it making? What would cause this to fail?” The fresh session matters, because the model isn’t anchored to defending what it just wrote. This is the idea behind Self-Refine, a NeurIPS 2023 paper that reported improvements of around 20% on several tasks from iterative self-critique, without any additional training. There’s a catch, though: a 2024 paper from Stechly, Valmeekam and Kambhampati found that pure self-critique can actually make reasoning performance worse on some tasks — the model talks itself into the wrong answer. Which leads to the next pattern.
Cross-examine with a different model. If I’ve had Claude draft a tricky piece of network config, I’ll often paste it into ChatGPT or Gemini and ask what it would do differently and why. The disagreements are where the interesting errors live. The models have different training data, different reasoning styles, different blind spots. Where they all agree, my confidence goes up — but it’s not proof. Frontier models are trained on heavily overlapping corpora and can confidently agree on the same wrong answer, especially on topics where the public internet is itself wrong. So agreement is a signal, not a guarantee; disagreement is what tells me I need to go and read the documentation myself. This pattern is exactly the approach taken in Du et al.’s “Multiagent Debate” paper at ICML 2024, which found that having multiple LLM instances debate an answer over several rounds measurably reduces hallucination. Google DeepMind’s FACTS Grounding benchmark operationalises the same idea at institutional scale, using an ensemble of Gemini, GPT-4o and Claude to judge factuality.
Make the model do the research. For anything involving facts that change — pricing, versions, API behaviour, legislation — don’t trust the weights. Make the model search, cite the source, and then check the source actually says what it claims. Retrieval doesn’t eliminate hallucination — an EMNLP 2023 benchmark called ALCE found that even the best systems miss full citation support about 50% of the time, and as noted above, even with a good source document in hand, models can drift from what it actually says — but it massively reduces fabrication when combined with a human checking the citations. Vendors are starting to build this in: Anthropic’s Citations API, launched in January 2025, returns direct quotes from supplied source documents, and at least one customer reported reducing “source confabulations from 10% to zero.”
Build the friction in before you need it. The cheapest time to catch a hallucination is before you’ve acted on it. That means reviewing AI output before you paste it into the production config, send it to the client, or quote it in the board paper. Simon Willison puts it well: hallucinated code is actually the least dangerous failure mode, because the compiler tells you immediately. It’s the hallucinations that read clean and pass initial review that hurt. Treat the first draft as a draft.
What this means for how we use AI at work
At RWTS we do a lot of work where the cost of a confident wrong answer is high — network changes, security configurations, compliance documents. The teams who get good results with AI aren’t the ones who’ve bought the most expensive model. They’re the ones who’ve built review into the workflow. A senior engineer reviews the AI-drafted config. A peer reads the AI-drafted proposal before it goes out. The AI is a junior colleague with a photographic memory and occasional blind spots, not an oracle.
That framing isn’t mine — Ethan Mollick has been arguing for years that LLMs are best thought of as “weird, somewhat alien interns that work infinitely fast and sometimes lie to make you happy.” Simon Willison goes further and calls them “a growing army of weird digital interns who will absolutely cheat if you give them a chance.” The operational answer, in both cases, is the same: tests, specs, code review, and a human in the loop who actually knows the domain.
That’s the one I’d offer to anyone trying to work out how much to trust these tools. You wouldn’t let a talented graduate push code to production unreviewed on day one. You wouldn’t let them send a client a statement of work without someone else reading it. The same rules apply.
Hallucinations aren’t going away. They’re a property of how these systems work, not a bug that’ll be patched out in the next release. The models will keep getting better at sounding right. Our job is to make sure the review process keeps up.
A note on process: I drafted this post with Claude’s help, then had a different model review it for overstated claims and weak citations. Several paragraphs got tightened as a result. Which is, roughly, the point.
I’m the Director and CTO over at Real World Technology Solutions. RWTS helps organisations get real value from AI without betting the business on it. Call us on 1300 798 718.
The consultation is being run as a Work Health and Safety process, with a survey that closes on 25 May. If you’ve been sent the link and weren’t sure how to engage with it, this post is for you.
I respond to legislative and regulatory consultations regularly, and I sit on committees that do the same. When I read a consultation paper, I need to understand what I’m being asked and why. What does the evidence say? What are the risks? What are the alternatives?
The PCNSW survey asks important questions, but it doesn’t provide any explanatory context tohelp you answer the questions well. So I went looking for that context. This is what I found.
A few things worth saying upfront.
This isn’t an attempt to argue the theological question of whether Scripture requires male-only eldership. That conversation is being had elsewhere, by people more qualified than me to have it. What this <em>does</em> ask is whether the process, the safeguards, and the evidence are adequate for a change of this magnitude. Those are governance questions, not theological ones, and they deserve honest engagement with the evidence we have. It’s also not a committee document. I’m a member and lay leader at my own church. I don’t have standing in the Assembly, and this hasn’t been produced in consultation with the WHS Consultative Committee or any denominational body.
What I do bring is experience in governance across business, not-for-profit, and community organisations, and time spent working on child safety policy. That work has shaped how I think about authority, accountability, and what happens when institutions get the structures wrong.
One thing readers should know: the evidence on the questions this survey asks runs predominantly in one direction — toward caution about the proposed change. That’s not because I’ve been selective. It’s because the evidence we’d need to feel confident about this isn’t there.
<ul>
<li>No public systematic evaluation has been identified showing what happened after any denomination restricted or removed women from governance.</li>
<li>No baseline assessment, no transition measurement, no outcome evaluation has been published for any Australian state church that moved to male-only eldership.</li>
<li>No binding structural safeguards specifically designed for all-male governance have been identified in any complementarian denomination, anywhere.</li>
<li>The alternative participation mechanisms that exist — advisory committees, women’s ministry groups, “co-worker” roles — don’t appear to have been independently evaluated.</li>
</ul>That evidence gap is itself worth weighing. It is usual to see that institutions that are confident the change had worked well have measured it.
Where things get more complicated is in our own context. Most PCNSW congregations don’t currently have women serving as elders. In many churches, sessions have been all-male for years or always. The Code permits women to serve as ruling elders — and has done since the 1967 GAA — but most churches have not exercised that permission.
For some congregations, the proposed change would formalise what is already the case. For the small number of churches that do have women elders, current women elders would continue to serve in that role, but no new women could be elected. The door closes permanently for everyone.
There’s a real difference, though, between a church that has <em>chosen</em> its current arrangement and a church that is <em>prohibited</em> from doing otherwise. The current Code preserves the freedom of each congregation to discern, under the guidance of Scripture and the Spirit. The proposed change removes that freedom denomination-wide. Even for churches that haven’t exercised that freedom, that’s a different thing.
What follows is structured around the survey itself. Each section gives you some evidence — research, the Royal Commission, the experience of comparable denominations, our own data — and a set of reflective questions designed to help you think through your own answer.
A note before you start: this covers a lot of ground. You don’t need to read it all at once, and you don’t need to do the survey straight away. Read a section. Sit with it. Talk about it with someone — a friend, your home group, your spouse, your minister. The questions are sharper when you’ve heard how someone else answers them. If your session is willing to host a conversation, this could serve as a starting point.
<em>With thanks to Valerie Ling (Centre for Effective Serving), a registered clinical psychologist and supervisor, for sharing her clinical and doctoral research into psychosocial safety, governance, and gender in church contexts which helped inform the framing of the WHS evidence here.</em>
<hr>
<h3>A note on how to read this</h3>We’re a Reformed church. We believe Scripture is the supreme authority for faith and practice. We also believe, as the Westminster Confession teaches, that God has given us reason, conscience, and the experience of the church across the ages as gifts for understanding how to apply Scripture wisely. The Reformers built Presbyterian governance on the conviction that all human authority is fallible and must be accountable. They insisted on plurality of elders, parity between ministers and ruling elders, and the right of appeal to higher courts — precisely because they took total depravity seriously as a reality that shapes how institutions must be designed.
The questions in this survey are governance questions. They deserve both spiritual discernment and honest engagement with the evidence. The two are not in competition. A church that ignores evidence is not being more faithful — it is being less careful.
What follows is written for churches in every situation: those that have women elders and would lose the option of electing more, those that have never had women elders and may see no immediate change, and those in between. Wherever your church sits, the question the survey is asking is the same: what are the impacts if the door is permanently closed?
<hr>
<h3>What you should know before you start</h3>The survey exists because changes to governance structures can create psychosocial risks — risks to the psychological health and safety of the people in our churches. Under NSW law, PCBUs must identify and manage reasonably foreseeable risks to health and safety, including psychosocial risks, in connection with proposed changes and on an ongoing basis, not only after changes have been implemented.
The PCNSW’s own WHS Manual acknowledges the denomination’s legal obligations as a Person Conducting a Business or Undertaking (PCBU) under the <em>Work Health and Safety Act 2011</em> (NSW). Volunteers, including elders, are legally defined as “workers” under this Act. The Manual opens by affirming both a theological duty of care (love your neighbour) and a legal duty of care (comply with WHS) as a single commitment. The denomination itself has linked these two frames. They cannot later be separated when it is convenient to do so.
Under section 48 of the Act, consultation about changes affecting health and safety requires that four elements are present: sharing relevant work health and safety information with workers; giving workers a reasonable opportunity to express their views and raise health or safety issues; giving workers a reasonable opportunity to contribute to the decision-making process relating to the health and safety matter; and taking workers’ views into account and advising them of the outcome in a timely manner. The survey includes free-text boxes and open questions, and the Consultative Committee has indicated it will summarise responses from the survey and other consultations. On the face of the survey and the public material, information-sharing is clear, but the wider consultation process is not yet transparent enough to assess how the remaining section 48 elements will be met. It is possible that further consultation steps are being undertaken by the Committee that are not yet publicly visible — this resource can only assess what is in the public record. SafeWork NSW’s guidance is clear that there is no set way consultation must occur — but it must be genuine, and workers must have a reasonable opportunity to express views and contribute to decisions.
In September 2025, SafeWork NSW issued a prohibition notice requiring the University of Technology Sydney to halt a redundancy process, not because redundancies are unlawful in themselves, but because the <em>way the process was being managed</em> created a serious and imminent risk of psychological harm. This appears to be the first publicly reported instance of SafeWork NSW using a prohibition notice to stop a redundancy or restructuring process on psychosocial safety grounds.
Why does this matter for your church? Professor Tuckey’s research with a major Australian retailer showed that when organisations change the structures that govern people’s working lives — reporting lines, authority, who has a voice and who doesn’t — the way they manage that change determines whether people are protected or harmed. When the retailer redesigned its conditions through structured risk surveys, co-design with affected workers, and measured follow-up, bullying dropped. When institutions skip those steps, the process itself can become the source of harm. The UTS notice shows that regulators will now treat poorly managed change processes as psychosocial hazards in their own right. The proposed change to the PCNSW Code would alter who governs, who is heard, and who has formal standing in every congregation. That is exactly the kind of structural change that requires careful management. If it proceeds without adequate risk assessment, without genuine consultation, and without safeguards in place, the people most likely to bear the cost are the women who serve faithfully, the volunteers who raise concerns, and the children whose safety depends on governance that listens.
<strong>Who carries the legal duty?</strong> The PCNSW’s own WHS Guidelines state that each congregation is a PCBU, and that “much of this responsibility falls on the Committee of Management.” Officers of a PCBU must exercise “due diligence” to ensure compliance. The Guidelines are explicit: “It is not a defence to claim, ‘We can’t do anything about it’ or ‘We’ve been doing it this way for years.'” Under the Act, an officer is anyone who makes, or participates in making, decisions that affect the whole or a substantial part of the undertaking — which in the context of a congregation includes elders, ministers, and members of the committee of management. The decision on whether to change the Code rests with the members of the Assembly and, under the Barrier Act, with the members of presbyteries who must ratify it. But every officer at the local level has a duty of care for the safety of the people in their congregation during the consultation process that is happening now.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>Does your session know about these legal obligations? Have they discussed them with your congregation?</li>
<li>Has anyone in your church been given the opportunity to raise issues about this proposed change that go beyond the preset questions in this survey?</li>
<li>If you have concerns that aren’t captured by the survey questions, how would you raise them?</li>
<li>Have you considered what governance, due process, and complaint pathways will need to be in place at the local level if this change proceeds, and what psychosocial risks those processes may create for ministers, elders, staff, and volunteers?</li>
<li>Have you thought about what psychosocial hazards might arise in your own church from the way this debate and survey are conducted — for example, conflict, exclusion, distress, loss of trust, or fear of speaking up?</li>
<li>Have you ensured that people in your church can participate in the survey with enough information to understand potential impacts, and with genuine freedom to answer honestly?</li>
<li>Before this change proceeds, have you, in your role as an elder or leader, been meaningfully consulted about the WHS implications you will be responsible for managing — and have you had the chance to raise your own concerns?</li>
</ul>
</blockquote>
<hr>
<h3>Section 2: Church Environment and Culture</h3>
<h4>Survey Q1: “If women are unable to serve as elders, do you feel that quality of leadership will be affected?”</h4><strong>Evidence:</strong> Both sides of the theological debate in the PCNSW acknowledge that governance bodies comprising men and women working together show stronger governance inputs — better monitoring, greater accountability, and more rigorous challenge. The Elders and Deacons Committee’s own 2021 report cited this evidence. Research on corporate and nonprofit boards finds that gender-diverse boards show stronger monitoring, better attendance, and greater accountability (Adams & Ferreira, 2009; Buse, Bernstein & Bilimoria, 2016). Irving Janis’s foundational work on groupthink identified homogeneity of membership as a structural fault that makes groups stop questioning themselves.
<strong>An important nuance:</strong> Adams and Ferreira found that gender diversity’s effect depends on existing governance quality. In organisations that were already well-governed — with strong accountability, active oversight, and robust challenge — adding diversity provided diminishing returns and could even reduce performance through over-monitoring. But in poorly-governed organisations, where scrutiny was weak and challenge was absent, gender diversity added significant value. An ANU review confirmed that the average effect across all organisations is close to zero — but the governance-specific benefits (improved scrutiny, stronger challenge, greater sensitivity to ethical risk) are robust. The question for the PCNSW is: are all our sessions in the “well-governed” category? The denomination has never measured this.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>If your church has women elders: has there been a time when a woman elder brought a perspective or noticed something that might have been missed by an all-male group? What would be lost if that voice were no longer at the table?</li>
<li>If your session is already all-male: how are women’s perspectives currently represented in governance decisions? Is that arrangement formal or informal, and does it carry real weight? Could your congregation be strengthened if a gifted woman were able to serve as an elder in the future?</li>
<li>The research suggests that sessions which already consult women effectively, handle complaints well, and take safeguarding seriously may not need the structural safeguard of women elders to achieve good governance outcomes. But sessions where consultation is thin, complaints are handled informally, and the minister’s authority goes largely unchecked are exactly where diverse governance adds the most value. The proposed change applies to both. Which category is your session in — and how would you know?</li>
<li>The current Code preserves the freedom of each congregation to discern whether women should serve as elders. The proposed change removes that freedom for every church in the denomination. Even if your church has not exercised that freedom, is there value in having it?</li>
<li>Presbyterian governance was built on plurality and accountability — the conviction that unchecked authority corrupts. The Westminster divines established the ruling elder as a representative of the whole congregation. If ruling elders represent the congregation, and the congregation includes women, what does it mean for that representation when women cannot serve?</li>
</ul>
</blockquote>
<h4>Survey Q2: “Do you believe this proposed change could impact the church’s culture?”</h4><strong>Evidence:</strong> The Australian Royal Commission into Institutional Responses to Child Sexual Abuse (2017) found that the absence or insufficient involvement of women in leadership and governance in religious institutions negatively affected decision-making and accountability, and may have contributed to inadequate responses to child sexual abuse. The Commission identified clericalism — the elevation of ordained or ministerial status above laity — as a significant contributing factor. Recommendation 16.37 called for child-safety advisory mechanisms that include lay men and women with relevant expertise.
<strong>Organisational climate research:</strong> Professor Maureen Dollard’s Psychosocial Safety Climate (PSC) framework — developed at the University of South Australia and studied across 38 research studies internationally (Amoadu et al., 2024) — demonstrates that the policies, practices, and procedures set by senior leadership determine the psychological safety of everyone underneath them. PSC is what researchers call the “cause of the causes”: when it is low, bullying, harassment, and burnout follow predictably. Governance changes that concentrate authority and reduce the diversity of voices at the top directly affect an organisation’s psychosocial safety climate. This is not a corporate abstraction — it describes the conditions that shape whether people in your church feel safe to speak, to raise concerns, and to trust that leadership will respond.
<strong>Experience of comparable churches:</strong> The <a href="https://www.savethepca.com/ffo/">Save the PCA “Functional Female Officer Report”</a> (2025) surveyed 1,964 churches in the Presbyterian Church in America (not to be confused with the Presbyterian Church of Australia) — a denomination that has never ordained women as ruling elders — and found 9.5% had some form of functional female officer: 4.0% with women performing elder-like functions and 6.0% with women performing deacon-like functions. Practices included leading worship elements, sitting in session meetings as “elder advisors,” going through officer training, and being commissioned with public vows — all without formal office. Male-only ordination, in practice, can generate pressure toward informal workarounds that sit uneasily with the denomination’s own polity.
<strong>The Australian experience:</strong> Five of Australia’s six state Presbyterian churches moved to male-only eldership between 1984 and the early 2000s. Queensland was first in 1984; Victoria followed in 1997–98; Tasmania, Western Australia, and South Australia followed shortly after. NSW remains the sole holdout. None of these state churches conducted a baseline assessment before the change, measured impact during the transition, or evaluated outcomes afterward. Alternative participation mechanisms were created — women’s ministries committees, advisory groups, “fixed orders of the day” allowing women to comment but not vote at meetings — but none has been independently evaluated for effectiveness.
<strong>Our own denomination’s data:</strong> The <a href="https://www.wmpca.org.au/2023-gaa-paper">Women’s Ministry Committee of the Presbyterian Church of Australia</a> conducted a survey across the PCA in 2020 (484 respondents: 324 women, 162 men) and presented its findings to the 2023 GAA. The results came from our own churches. More than 50% of respondents could not affirm that women trust the elders with issues specifically related to women such as domestic violence and sexism. Fewer than 50% of women believed they were consulted by elders regarding church direction. More than 30% of women felt limited in what they could do in church. On every major question, there were statistically significant differences between how men and women perceived the same reality — a communication gap the WMPCA report itself identified as needing urgent attention.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>The PCNSW has permitted women ruling elders since the 1967 GAA, though most congregations have not elected them. For churches that have: what would change in your culture if no new women could be ordained? For churches that haven’t: does the fact that the permission <em>exists</em> shape your culture in ways you might not notice until it is gone?</li>
<li>If the change proceeds, how would your church ensure that women’s voices continue to shape governance decisions? Would that mechanism be formal or informal? Would it carry voting power, or only advisory status? Is there a woman in your congregation who, under different circumstances, might be called to serve as an elder? What would it mean to her — and to your church — if that call could never be tested?</li>
<li>The Royal Commission examined what happens in religious institutions when governance becomes exclusively male. You may not think your church is at risk of those failures. But Reformed theology teaches us to build structures for the reality of sin, not the aspiration of godliness. What structures would your church need if this change proceeds?</li>
<li>Have you seen examples — in your own experience or in other churches — where the presence or absence of women in leadership affected the culture or decision-making of the community?</li>
</ul>
</blockquote>
<h4>Survey Q3: “Do you believe the proposed change could have an impact on interpersonal relationships or team dynamics in your local church?”</h4><strong>Evidence:</strong> Research on volunteer organisations finds that when people perceive they have been excluded from decision-making or that their role has been diminished, their engagement drops and relationships within the team become strained (Allen & Mueller, 2013). This is not about theology — it is about how human beings respond when the terms of their participation change. In a local church, the proposed change would alter the relationship between women who serve and the governance structures they serve under. Women who have been elders, or who aspired to be, would need to find a new understanding of their place. Men who valued their female colleagues on session would lose those working relationships. The dynamics affect everyone, not just those directly excluded.
<strong>Experience of comparable churches:</strong> When the Christian Reformed Church opened all offices to women in 1995, 36 churches with approximately 7,500 members left the denomination. Governance changes on this question fracture communities in both directions. In churches where relationships are strong and consultation is genuine, the change may be absorbed without visible damage. But in churches where relationships are already strained, where women feel undervalued, or where the debate has been conducted without listening to the people most affected, the formal act of closing the door can crystallise tensions that were previously manageable. The proposed change applies uniformly to every congregation regardless of its relational health.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>Think about the women in your congregation who currently serve in any form of leadership — as elders, as ministry workers, as volunteers who carry real responsibility. How would they experience this change? Have you asked them?</li>
<li>If your church has a woman elder: what is it like for her to serve on a session alongside colleagues who may have supported the overture to end her office? What does that dynamic do to the working relationship, the trust, and the ability to govern together during the transition?</li>
<li>Are there people in your church who might leave if this change is made? Are there people who might leave if it isn’t? What would either departure mean for your community?</li>
<li>Even in churches without women elders, the <em>permission</em> for women to serve sends a signal about how the church values women’s gifts. Removing that permission also sends a signal. How would the women and girls in your church receive it?</li>
<li>Paul writes in 1 Corinthians 12 that the body cannot say to any member, “I have no need of you.” How does your church currently say to its women, “We need you”? Would that change?</li>
</ul>
</blockquote>
<hr>
<h3>Section 3: Wellbeing and Safety</h3>
<h4>Survey Q1: “Do you believe the proposed change will impact on your emotional or psychological wellbeing?”</h4><strong>Evidence:</strong> Research on volunteers consistently finds that burnout and disengagement are driven by role ambiguity, lack of voice, and exclusion from decision-making (Allen & Mueller, 2013). Psychosocial safety research shows that harm from organisational change often emerges over time rather than immediately. This is relevant here: for some women, the impact of this change may not be felt at the point of announcement but later — as the change is experienced in practice, in session meetings, in pastoral care decisions, in the gradual realisation that a door has been permanently closed. People respond differently to institutional change; the survey is asking about <em>your</em> response, and it is worth being honest about what that might be over time, not just right now.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>This question asks about <em>your</em> wellbeing. Take a moment to sit honestly with that. Not what you think the right answer is. Not what you think your church expects you to feel. What do you actually feel when you imagine this change taking effect?</li>
<li>If you are a woman: does this change affect your sense of belonging in your church? Your sense that your contribution is valued? Your willingness to serve?</li>
<li>If you are a woman currently serving as an elder: how does it feel to know that your denomination is considering a change that says no other woman should hold the role you hold? How will it affect your experience of serving in a role that your denomination has decided should not exist for anyone who comes after you? Has anyone asked you?</li>
<li>If you are a man: how do you think the women in your church would answer those questions? Have you asked them directly?</li>
<li>Genesis 3:16 describes a distortion of the relationship between men and women as a consequence of the Fall. Complementarian theologians including John Piper and Al Mohler acknowledge that male authority, post-Fall, tends toward sinful domination. If that’s true — and it’s in our own literature — what governance structures guard against that tendency? Does this change strengthen or weaken them?</li>
</ul>
</blockquote>
<h4>Survey Q2: “Do you currently feel safe and supported in your local church environment?”</h4><strong>Evidence:</strong> The PCNSW’s <a href="https://breakingthesilence.org.au/">Breaking the Silence</a> program has been in place since 1997 and is mandatory for elders, ministers, and those working with children or vulnerable people. BTS is more developed than many people realise. Foundations training runs on a three-year cycle with annual Read and Review refreshers for those working with children. The CPU conducts compliance audits. The online training requires an 80% quiz score for satisfactory completion. This is real work done by serious people.
<strong>The harder question</strong> is not whether BTS exists, but what public evidence is available that it changes reporting behaviour and outcomes over time. The quiz is a knowledge check — it tests whether you <em>know</em> what to do, not whether you <em>can</em> do it. It does not assess the ability to actually handle a disclosure, manage a conflict of interest, or respond to a vulnerable person in distress. In 28 years, no public independent evaluation of BTS effectiveness or published outcome data has been identified. The system may work well — the training, audit, and complaint-handling machinery is real. But no public independent outcome evaluation has been identified showing whether BTS improves reporting behaviour and outcomes over time, and that gap matters if we are about to remove another layer of structural accountability.
A 2025 peer-reviewed study in <em>Child Abuse & Neglect</em> (Hunt, Higgins & Willis) interviewed 20 Christian leaders across denominations about safeguarding training and found a compliance-focused mentality rather than genuine culture change, with cultural resistance from some leaders who see safeguarding as unnecessary external regulation.
<strong>What our own denomination’s data says:</strong> The PCA’s 2020 Women’s Ministry Committee survey asked questions directly relevant to whether people feel safe and supported. More than 50% of respondents could not affirm that women who experience domestic violence know who to talk to about it within the church. More than 50% could not affirm that women trust the elders with issues specifically related to women. There were statistically significant differences between how female congregation members and male ministers answered the question of whether women trust elders with these issues. If that gap exists now — in a denomination where women can still serve as elders — what happens when that possibility is removed?
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>This is a baseline question. Answer it for how things are <em>now</em>, not how they might change. The honesty of this answer matters because it tells the Assembly what the starting point is.</li>
<li>If a woman in your church experienced harm from a male leader — bullying, harassment, spiritual abuse — where would she go? Who would she trust? Is that pathway clear, accessible, and safe?</li>
<li>Has your session ever discussed what would happen if someone made a complaint? Not in theory. Practically. Who handles it? What process is followed?</li>
<li>Do you know whether your elders and ministry workers have completed Breaking the Silence training? Could you find out?</li>
</ul>
</blockquote>
<h4>Survey Q3: “Do you believe the proposed change could influence the amount of conflict, bullying, or harassment?”</h4><strong>Evidence:</strong> Bullying and harassment in organisations are not primarily caused by bad individuals. Professor Michelle Tuckey’s research at the University of Adelaide has shown that the majority traces to how an organisation is structured — its governance, its reporting lines, who holds authority, and who can challenge it. In plain terms: when your church changes who sits on session, who gets a vote, and who has formal standing, it is changing the very structures that research shows either prevent or produce harm. Tuckey’s <a href="https://www.bridgesatwork.au/">BRIDGES at Work</a> framework identifies formal organising arrangements as one of four subsystems that determine whether an organisation is safe. The proposed change directly alters those arrangements in every congregation.
<strong>What this looks like in churches:</strong> Research consistently shows that women in church settings already experience higher rates of burnout and lower confidence that leadership will respond to their concerns. Emerging evidence suggests women may be disproportionately affected when the organisational climate around psychological safety is poor (Amoadu et al., 2024). If the conditions are already difficult for women in churches, a governance change that reduces their formal voice does not make those conditions better.
<strong>The experience of comparable churches:</strong> These are not distant examples. The Royal Commission found that in some religious institutions — institutions led by people who believed they were serving God faithfully — the absence of women in governance negatively affected accountability in child-safety contexts. The Southern Baptist Convention’s Guidepost report (2022) documented over 700 accused abusers under exclusively male governance. The Presbyterian Church in America’s 2023 General Assembly — an all-male body — voted down all four abuse-prevention proposals put before it. The question is not whether our people would do the same. It is what structures we have in place to make sure they don’t have to.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>Has your church experienced conflict, bullying, or harassment? If so, how was it handled? Would the proposed change make that handling better or worse?</li>
<li>The Reformed tradition teaches total depravity — not that every person is as bad as they could be, but that sin touches every part of human life, including the exercise of authority. We build structures — plurality of elders, higher courts, the right of appeal — because we know that good intentions are not a sufficient safeguard. Does this change add accountability or remove it?</li>
<li>The Royal Commission heard from more than 4,000 survivors of abuse in religious institutions. Its findings about male-only governance are not theoretical. They are based on evidence from institutions that believed they were serving God faithfully. How should those findings inform your answer?</li>
<li>Think about specific scenarios: a woman experiencing domestic violence, a child safety concern raised by a female volunteer, a complaint about a male leader’s conduct. Under the proposed arrangement, who hears these? Who decides what happens next?</li>
</ul>
</blockquote>
<h4>Survey Q4: “Do you believe that this could impact the willingness of staff or volunteers to report issues such as conflict, bullying, or harassment?”</h4><strong>Evidence:</strong> The Presbyterian Church in America’s Ad Interim Study Committee on Domestic Abuse and Sexual Assault (2022) documented significant concerns about women’s ability to trust male-only elder bodies with issues like domestic violence and sexism. Advisory member Ann Maree Goudzwaard noted that substantial work remains before women in the American PCA can have assurance their case will be shepherded well. Murray Capill, Dean of Ministry Development at the Reformed Theological College in Melbourne, acknowledged that “from my experience and observation, lots of us don’t do that well” at consulting women — and this in a denomination that already has male-only eldership.
<strong>Governance research:</strong> There is a reasonable concern, supported by governance and institutional abuse research, that when the people receiving complaints share the demographic profile of the people being complained about, reporting confidence is affected. When there is no one on the governance body who shares the complainant’s experience, trust may diminish. The PCA (Australia)’s own 2020 survey data — where more than half of women could not affirm trust in elders with women-specific issues — is consistent with this concern.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>If you needed to report a concern about a male leader’s conduct, would you feel comfortable bringing it to an all-male session? Would the women in your church?</li>
<li>Is there currently a woman in your church’s governance who would be a first point of contact for women with concerns? If the proposed change proceeds, who fills that role? What standing would they have?</li>
<li>Craig Tucker’s paper notes that dealing with complaints by women against male leaders is “particularly problematic when the case is heard by a group in which only men get a vote.” Do you agree? Does your church have any mechanism to address this problem?</li>
<li>The survey itself directs people who are feeling impacted to “reach out to your Minister, Elders or leadership.” For some respondents, these are the very people they find it hardest to approach. Does your church have an alternative pathway?</li>
</ul>
</blockquote>
<hr>
<h3>Section 4: Support and Recommendations</h3>
<h4>Survey Q1: “What additional support or resources do you believe are needed to cultivate a safe and healthy environment if the proposed change is made?”</h4>This is an open question, and it is your opportunity to name specific, concrete measures rather than general sentiments. The evidence suggests several areas that would need to be addressed if the change proceeds. You may find these helpful as starting points for your own answer:
<strong>From advisory models to demonstrated safeguards.</strong> Alternative models for women’s participation have been proposed and in some cases Assembly-endorsed. The <a href="https://www.pcnswwomen.org.au/blog/assembly-resolution">GANSW 2022 resolution</a> encouraged sessions to establish women’s advisory groups, appoint women to “co-worker” positions with associate-style privileges, include women in complaints and disciplinary processes, and appointed a Women’s Engagement Working Group. The national <a href="https://www.wmpca.org.au/2023-gaa-paper">WMPCA deliverances</a> (2023 GAA) went further with similar recommendations at presbytery and Assembly level. This is real work. But the mechanisms are advisory (“encourage each Session to consider”), not binding. Uptake has been slow — a growing number of churches have established women’s advisory groups, but implementation is uneven. No binding, denomination-wide, independently evaluated mechanism has been demonstrated as an effective safeguard alongside this proposed change. The model on offer is partial, and its effectiveness has not been tested. The experience of other denominations suggests that advisory and associate-style mechanisms, without formal standing, often do not deliver the voice they promise.
<strong>Independent evaluation of safeguarding training.</strong> BTS has an 80% knowledge quiz and the CPU conducts compliance audits — but after 28 years, the denomination should also be able to demonstrate publicly that its training changes behaviour and outcomes. Competency-based assessment (testing the ability to <em>handle</em> a disclosure, not just <em>know about</em> one), independent external evaluation, and published outcome data would allow the denomination to know — in a measurable sense — whether its safeguarding system works.
<strong>An adequately independent complaints pathway.</strong> The PCNSW does have a Conduct Protocol Unit and a contact-person pathway intended to provide independence from the local church. The live question is whether that pathway is sufficiently independent from the denomination itself, sufficiently visible to women and volunteers, and sufficiently trusted — particularly for complaints about male leaders heard by all-male bodies.
<strong>Measurable implementation of Healthy Complementarianism.</strong> The GANSW 2022 resolution encouraged sessions to establish women’s advisory groups, appoint women to “co-worker” positions, and include women in complaints and disciplinary processes, and appointed a Women’s Engagement Working Group to bring further recommendations. At the national level, the PCA’s Women’s Ministry Committee presented a detailed strategy to the 2023 GAA with 15 deliverances along similar lines — relevant context, though national deliverances do not bind the PCNSW. Dr Murray Smith, one of the architects of the overture, has acknowledged that if the church says ‘no’ to women in eldership without also saying ‘yes’ to all the ways men and women complement each other, it has only done half the job. Whether the GANSW’s own 2022 encouragements have been implemented at session level is the question the NSW Assembly should be able to answer before voting on the overture.
<strong>Learning from complementarian denominations that have tried.</strong> The Presbyterian Church in America’s DASA report (2022) is the most comprehensive safeguarding framework produced by any complementarian denomination for all-male governance. Its 220 pages cover abuse response, whistleblower protections, and mandatory background checks. But the report is non-binding, and efforts to implement even its background check recommendations have met resistance. Nowhere in the Reformed world has a denomination produced binding structural safeguards specifically designed for all-male governance. The gap between complementarian theology — which teaches that authority should protect the vulnerable — and complementarian institutional design remains vast.
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>Which of these measures does your church currently have in place? Which are absent?</li>
<li>Should any of these be required <em>before</em> the change takes effect, or are you comfortable with them being developed after?</li>
<li>Even if your church does not currently have women elders, the proposed change closes the door for every congregation, including those where women’s gifts in governance might be most needed — small country parishes, churches in vacancy, congregations where the work of eldership falls on very few shoulders. What support would those churches need?</li>
<li>The Reformers insisted on building structures before granting authority. Knox and Calvin did not consolidate power and then design accountability later. They built the accountability first. What does that principle suggest about the sequencing of this change?</li>
</ul>
</blockquote>
<h4>Survey Q2: “Do you have any recommendations for your local church and denominational leaders regarding health and safety as it relates to diversity and inclusion?”</h4>
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>What would you need to see from your session and presbytery to have confidence that this change — if it proceeds — will not diminish the safety and wellbeing of women in your church?</li>
<li>Are there practical steps your church could take now, regardless of the outcome of this vote, to strengthen the voice and safety of women in your community?</li>
<li>What would “doing this well” look like, even if you support the change? What would “doing this badly” look like?</li>
<li>If your church has never had women elders, have you considered why? It may be because your session holds a theological conviction that eldership should be male-only. It may be because the congregation discerned it was not right for your context at this time. Or it may be because the question was never asked. Each of these is a different starting point, and the proposed change affects each differently — the first formalises an existing conviction, the second removes a future option, and the third closes a door that was never opened.</li>
</ul>
</blockquote>
<hr>
<h3>Section 5: Final Comments</h3>The survey gives you an open field. If the structured questions did not capture what you most want to say, say it here. Some things worth considering:
<blockquote><strong>Questions worth sitting with</strong>
<ul>
<li>Is there something about this issue that keeps you awake at night? Something you haven’t been able to say to anyone in your church? This is the place for it.</li>
<li>If you are uncertain about the theological question, you are allowed to say so. The survey is not asking you to resolve the exegesis. It is asking what happens to real people in real churches if this change is made.</li>
<li>If you support the change but have concerns about how it is being implemented, this is the place to name those concerns. Supporting a theological position and questioning the process are not contradictory.</li>
<li>If your church has never had women elders, the proposed change might feel like it changes nothing for you. But consider: is there a difference between a church that has <em>chosen</em> its current arrangement and a church that has had the choice <em>removed</em>? What does that difference mean for the kind of denomination we want to be?</li>
<li>If you have experienced harm in a church setting — from male authority exercised without accountability, from complaints that went nowhere, from being told your concerns didn’t matter — your experience is evidence. It matters. You are not required to share it, but if you choose to, it will inform the Assembly’s understanding of what is at stake.</li>
</ul>
</blockquote>
<hr>
<h3>Section 6: Contact and Demographic Information</h3>The survey’s contact details (name and email) are optional. However, the demographic and identifying questions earlier in the survey — your church name, your role, your gender, whether your church has female elders — are not marked as optional.
The committee has stated that when presenting results to the Assembly, it will aggregate the data so that no individual responses can be identified by anyone outside the survey process, including by the Assembly itself.
It is still worth being aware, however, that in smaller congregations a combination of church name, role, and gender may be enough to make a respondent identifiable within the survey data itself — even without their name attached. If the survey is asking whether you trust the current governance structures to keep you safe, and your answers could be connected to you by those with access to the raw data, that may affect how freely you respond. This is not a reason not to complete the survey — your voice matters. But it is worth knowing.
The survey notes that “certain State or Federal legislation may require the denomination to share these details.” This is a standard data-handling disclosure. If you have concerns about confidentiality, you may choose not to provide your name or email while still completing the substantive sections.
If you are feeling particularly impacted by this issue, <a href="https://jerichoroad.org.au/counselling-service/">Jericho Road</a> provides access to counselling services. You do not need to go through your minister or elders to access this.
<hr>
<h3>A note on the consultation process</h3>The Assembly has charged sessions with responsibility for engaging in this consultation properly. That charge should be taken seriously. If your session has not discussed this survey with your congregation, has not made time for conversation about its questions, or has not explained how your responses will be used, you may wish to raise that directly.
Consultation that meets the legal and moral standard our denomination has set for itself requires more than distributing a link. It requires making space for people to be heard, especially those who may find it hardest to speak. In some churches, that will mean actively reaching out to women, to newer members, to those who serve faithfully but have never been asked what they think about how the church is governed.
<div style="background: #eaf0f6; padding: 14px 18px; border-radius: 4px; font-size: 1.05em; margin-top: 1.5em"><strong>The survey closes 25 May 2026.</strong> <a href="https://www.surveymonkey.com/r/WHSEldershipSurvey2026">surveymonkey.com/r/WHSEldershipSurvey2026</a></div>
<em>This document is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). You are free to share and adapt this material for any purpose, provided you give appropriate credit.</em>
The PCNSW is consulting on a proposed change to restrict future eldership to men.
The consultation is being run as a Work Health and Safety process, with a survey that closes on 25 May. If you’ve been sent the link and weren’t sure how to engage with it, this post is for you.
I respond to legislative and regulatory consultations regularly, and I sit on committees that do the same. When I read a consultation paper, I need to understand what I’m being asked and why. What does the evidence say? What are the risks? What are the alternatives?
The PCNSW survey asks important questions, but it doesn’t provide any explanatory context tohelp you answer the questions well. So I went looking for that context. This is what I found.
A few things worth saying upfront.
This isn’t an attempt to argue the theological question of whether Scripture requires male-only eldership. That conversation is being had elsewhere, by people more qualified than me to have it. What this does ask is whether the process, the safeguards, and the evidence are adequate for a change of this magnitude. Those are governance questions, not theological ones, and they deserve honest engagement with the evidence we have. It’s also not a committee document. I’m a member and lay leader at my own church. I don’t have standing in the Assembly, and this hasn’t been produced in consultation with the WHS Consultative Committee or any denominational body.
What I do bring is experience in governance across business, not-for-profit, and community organisations, and time spent working on child safety policy. That work has shaped how I think about authority, accountability, and what happens when institutions get the structures wrong.
One thing readers should know: the evidence on the questions this survey asks runs predominantly in one direction — toward caution about the proposed change. That’s not because I’ve been selective. It’s because the evidence we’d need to feel confident about this isn’t there.
No public systematic evaluation has been identified showing what happened after any denomination restricted or removed women from governance.
No baseline assessment, no transition measurement, no outcome evaluation has been published for any Australian state church that moved to male-only eldership.
No binding structural safeguards specifically designed for all-male governance have been identified in any complementarian denomination, anywhere.
The alternative participation mechanisms that exist — advisory committees, women’s ministry groups, “co-worker” roles — don’t appear to have been independently evaluated.
That evidence gap is itself worth weighing. It is usual to see that institutions that are confident the change had worked well have measured it.
Where things get more complicated is in our own context. Most PCNSW congregations don’t currently have women serving as elders. In many churches, sessions have been all-male for years or always. The Code permits women to serve as ruling elders — and has done since the 1967 GAA — but most churches have not exercised that permission.
For some congregations, the proposed change would formalise what is already the case. For the small number of churches that do have women elders, current women elders would continue to serve in that role, but no new women could be elected. The door closes permanently for everyone.
There’s a real difference, though, between a church that has chosen its current arrangement and a church that is prohibited from doing otherwise. The current Code preserves the freedom of each congregation to discern, under the guidance of Scripture and the Spirit. The proposed change removes that freedom denomination-wide. Even for churches that haven’t exercised that freedom, that’s a different thing.
What follows is structured around the survey itself. Each section gives you some evidence — research, the Royal Commission, the experience of comparable denominations, our own data — and a set of reflective questions designed to help you think through your own answer.
A note before you start: this covers a lot of ground. You don’t need to read it all at once, and you don’t need to do the survey straight away. Read a section. Sit with it. Talk about it with someone — a friend, your home group, your spouse, your minister. The questions are sharper when you’ve heard how someone else answers them. If your session is willing to host a conversation, this could serve as a starting point.
With thanks to Valerie Ling (Centre for Effective Serving), a registered clinical psychologist and supervisor, for sharing her clinical and doctoral research into psychosocial safety, governance, and gender in church contexts which helped inform the framing of the WHS evidence here.
A note on how to read this
We’re a Reformed church. We believe Scripture is the supreme authority for faith and practice. We also believe, as the Westminster Confession teaches, that God has given us reason, conscience, and the experience of the church across the ages as gifts for understanding how to apply Scripture wisely. The Reformers built Presbyterian governance on the conviction that all human authority is fallible and must be accountable. They insisted on plurality of elders, parity between ministers and ruling elders, and the right of appeal to higher courts — precisely because they took total depravity seriously as a reality that shapes how institutions must be designed.
The questions in this survey are governance questions. They deserve both spiritual discernment and honest engagement with the evidence. The two are not in competition. A church that ignores evidence is not being more faithful — it is being less careful.
What follows is written for churches in every situation: those that have women elders and would lose the option of electing more, those that have never had women elders and may see no immediate change, and those in between. Wherever your church sits, the question the survey is asking is the same: what are the impacts if the door is permanently closed?
What you should know before you start
The survey exists because changes to governance structures can create psychosocial risks — risks to the psychological health and safety of the people in our churches. Under NSW law, PCBUs must identify and manage reasonably foreseeable risks to health and safety, including psychosocial risks, in connection with proposed changes and on an ongoing basis, not only after changes have been implemented.
The PCNSW’s own WHS Manual acknowledges the denomination’s legal obligations as a Person Conducting a Business or Undertaking (PCBU) under the Work Health and Safety Act 2011 (NSW). Volunteers, including elders, are legally defined as “workers” under this Act. The Manual opens by affirming both a theological duty of care (love your neighbour) and a legal duty of care (comply with WHS) as a single commitment. The denomination itself has linked these two frames. They cannot later be separated when it is convenient to do so.
Under section 48 of the Act, consultation about changes affecting health and safety requires that four elements are present: sharing relevant work health and safety information with workers; giving workers a reasonable opportunity to express their views and raise health or safety issues; giving workers a reasonable opportunity to contribute to the decision-making process relating to the health and safety matter; and taking workers’ views into account and advising them of the outcome in a timely manner. The survey includes free-text boxes and open questions, and the Consultative Committee has indicated it will summarise responses from the survey and other consultations. On the face of the survey and the public material, information-sharing is clear, but the wider consultation process is not yet transparent enough to assess how the remaining section 48 elements will be met. It is possible that further consultation steps are being undertaken by the Committee that are not yet publicly visible — this resource can only assess what is in the public record. SafeWork NSW’s guidance is clear that there is no set way consultation must occur — but it must be genuine, and workers must have a reasonable opportunity to express views and contribute to decisions.
In September 2025, SafeWork NSW issued a prohibition notice requiring the University of Technology Sydney to halt a redundancy process, not because redundancies are unlawful in themselves, but because the way the process was being managed created a serious and imminent risk of psychological harm. This appears to be the first publicly reported instance of SafeWork NSW using a prohibition notice to stop a redundancy or restructuring process on psychosocial safety grounds.
Why does this matter for your church? Professor Tuckey’s research with a major Australian retailer showed that when organisations change the structures that govern people’s working lives — reporting lines, authority, who has a voice and who doesn’t — the way they manage that change determines whether people are protected or harmed. When the retailer redesigned its conditions through structured risk surveys, co-design with affected workers, and measured follow-up, bullying dropped. When institutions skip those steps, the process itself can become the source of harm. The UTS notice shows that regulators will now treat poorly managed change processes as psychosocial hazards in their own right. The proposed change to the PCNSW Code would alter who governs, who is heard, and who has formal standing in every congregation. That is exactly the kind of structural change that requires careful management. If it proceeds without adequate risk assessment, without genuine consultation, and without safeguards in place, the people most likely to bear the cost are the women who serve faithfully, the volunteers who raise concerns, and the children whose safety depends on governance that listens.
Who carries the legal duty? The PCNSW’s own WHS Guidelines state that each congregation is a PCBU, and that “much of this responsibility falls on the Committee of Management.” Officers of a PCBU must exercise “due diligence” to ensure compliance. The Guidelines are explicit: “It is not a defence to claim, ‘We can’t do anything about it’ or ‘We’ve been doing it this way for years.'” Under the Act, an officer is anyone who makes, or participates in making, decisions that affect the whole or a substantial part of the undertaking — which in the context of a congregation includes elders, ministers, and members of the committee of management. The decision on whether to change the Code rests with the members of the Assembly and, under the Barrier Act, with the members of presbyteries who must ratify it. But every officer at the local level has a duty of care for the safety of the people in their congregation during the consultation process that is happening now.
Questions worth sitting with
Does your session know about these legal obligations? Have they discussed them with your congregation?
Has anyone in your church been given the opportunity to raise issues about this proposed change that go beyond the preset questions in this survey?
If you have concerns that aren’t captured by the survey questions, how would you raise them?
Have you considered what governance, due process, and complaint pathways will need to be in place at the local level if this change proceeds, and what psychosocial risks those processes may create for ministers, elders, staff, and volunteers?
Have you thought about what psychosocial hazards might arise in your own church from the way this debate and survey are conducted — for example, conflict, exclusion, distress, loss of trust, or fear of speaking up?
Have you ensured that people in your church can participate in the survey with enough information to understand potential impacts, and with genuine freedom to answer honestly?
Before this change proceeds, have you, in your role as an elder or leader, been meaningfully consulted about the WHS implications you will be responsible for managing — and have you had the chance to raise your own concerns?
Section 2: Church Environment and Culture
Survey Q1: “If women are unable to serve as elders, do you feel that quality of leadership will be affected?”
Evidence: Both sides of the theological debate in the PCNSW acknowledge that governance bodies comprising men and women working together show stronger governance inputs — better monitoring, greater accountability, and more rigorous challenge. The Elders and Deacons Committee’s own 2021 report cited this evidence. Research on corporate and nonprofit boards finds that gender-diverse boards show stronger monitoring, better attendance, and greater accountability (Adams & Ferreira, 2009; Buse, Bernstein & Bilimoria, 2016). Irving Janis’s foundational work on groupthink identified homogeneity of membership as a structural fault that makes groups stop questioning themselves.
An important nuance: Adams and Ferreira found that gender diversity’s effect depends on existing governance quality. In organisations that were already well-governed — with strong accountability, active oversight, and robust challenge — adding diversity provided diminishing returns and could even reduce performance through over-monitoring. But in poorly-governed organisations, where scrutiny was weak and challenge was absent, gender diversity added significant value. An ANU review confirmed that the average effect across all organisations is close to zero — but the governance-specific benefits (improved scrutiny, stronger challenge, greater sensitivity to ethical risk) are robust. The question for the PCNSW is: are all our sessions in the “well-governed” category? The denomination has never measured this.
Questions worth sitting with
If your church has women elders: has there been a time when a woman elder brought a perspective or noticed something that might have been missed by an all-male group? What would be lost if that voice were no longer at the table?
If your session is already all-male: how are women’s perspectives currently represented in governance decisions? Is that arrangement formal or informal, and does it carry real weight? Could your congregation be strengthened if a gifted woman were able to serve as an elder in the future?
The research suggests that sessions which already consult women effectively, handle complaints well, and take safeguarding seriously may not need the structural safeguard of women elders to achieve good governance outcomes. But sessions where consultation is thin, complaints are handled informally, and the minister’s authority goes largely unchecked are exactly where diverse governance adds the most value. The proposed change applies to both. Which category is your session in — and how would you know?
The current Code preserves the freedom of each congregation to discern whether women should serve as elders. The proposed change removes that freedom for every church in the denomination. Even if your church has not exercised that freedom, is there value in having it?
Presbyterian governance was built on plurality and accountability — the conviction that unchecked authority corrupts. The Westminster divines established the ruling elder as a representative of the whole congregation. If ruling elders represent the congregation, and the congregation includes women, what does it mean for that representation when women cannot serve?
Survey Q2: “Do you believe this proposed change could impact the church’s culture?”
Evidence: The Australian Royal Commission into Institutional Responses to Child Sexual Abuse (2017) found that the absence or insufficient involvement of women in leadership and governance in religious institutions negatively affected decision-making and accountability, and may have contributed to inadequate responses to child sexual abuse. The Commission identified clericalism — the elevation of ordained or ministerial status above laity — as a significant contributing factor. Recommendation 16.37 called for child-safety advisory mechanisms that include lay men and women with relevant expertise.
Organisational climate research: Professor Maureen Dollard’s Psychosocial Safety Climate (PSC) framework — developed at the University of South Australia and studied across 38 research studies internationally (Amoadu et al., 2024) — demonstrates that the policies, practices, and procedures set by senior leadership determine the psychological safety of everyone underneath them. PSC is what researchers call the “cause of the causes”: when it is low, bullying, harassment, and burnout follow predictably. Governance changes that concentrate authority and reduce the diversity of voices at the top directly affect an organisation’s psychosocial safety climate. This is not a corporate abstraction — it describes the conditions that shape whether people in your church feel safe to speak, to raise concerns, and to trust that leadership will respond.
Experience of comparable churches: The Save the PCA “Functional Female Officer Report” (2025) surveyed 1,964 churches in the Presbyterian Church in America (not to be confused with the Presbyterian Church of Australia) — a denomination that has never ordained women as ruling elders — and found 9.5% had some form of functional female officer: 4.0% with women performing elder-like functions and 6.0% with women performing deacon-like functions. Practices included leading worship elements, sitting in session meetings as “elder advisors,” going through officer training, and being commissioned with public vows — all without formal office. Male-only ordination, in practice, can generate pressure toward informal workarounds that sit uneasily with the denomination’s own polity.
The Australian experience: Five of Australia’s six state Presbyterian churches moved to male-only eldership between 1984 and the early 2000s. Queensland was first in 1984; Victoria followed in 1997–98; Tasmania, Western Australia, and South Australia followed shortly after. NSW remains the sole holdout. None of these state churches conducted a baseline assessment before the change, measured impact during the transition, or evaluated outcomes afterward. Alternative participation mechanisms were created — women’s ministries committees, advisory groups, “fixed orders of the day” allowing women to comment but not vote at meetings — but none has been independently evaluated for effectiveness.
Our own denomination’s data: The Women’s Ministry Committee of the Presbyterian Church of Australia conducted a survey across the PCA in 2020 (484 respondents: 324 women, 162 men) and presented its findings to the 2023 GAA. The results came from our own churches. More than 50% of respondents could not affirm that women trust the elders with issues specifically related to women such as domestic violence and sexism. Fewer than 50% of women believed they were consulted by elders regarding church direction. More than 30% of women felt limited in what they could do in church. On every major question, there were statistically significant differences between how men and women perceived the same reality — a communication gap the WMPCA report itself identified as needing urgent attention.
Questions worth sitting with
The PCNSW has permitted women ruling elders since the 1967 GAA, though most congregations have not elected them. For churches that have: what would change in your culture if no new women could be ordained? For churches that haven’t: does the fact that the permission exists shape your culture in ways you might not notice until it is gone?
If the change proceeds, how would your church ensure that women’s voices continue to shape governance decisions? Would that mechanism be formal or informal? Would it carry voting power, or only advisory status? Is there a woman in your congregation who, under different circumstances, might be called to serve as an elder? What would it mean to her — and to your church — if that call could never be tested?
The Royal Commission examined what happens in religious institutions when governance becomes exclusively male. You may not think your church is at risk of those failures. But Reformed theology teaches us to build structures for the reality of sin, not the aspiration of godliness. What structures would your church need if this change proceeds?
Have you seen examples — in your own experience or in other churches — where the presence or absence of women in leadership affected the culture or decision-making of the community?
Survey Q3: “Do you believe the proposed change could have an impact on interpersonal relationships or team dynamics in your local church?”
Evidence: Research on volunteer organisations finds that when people perceive they have been excluded from decision-making or that their role has been diminished, their engagement drops and relationships within the team become strained (Allen & Mueller, 2013). This is not about theology — it is about how human beings respond when the terms of their participation change. In a local church, the proposed change would alter the relationship between women who serve and the governance structures they serve under. Women who have been elders, or who aspired to be, would need to find a new understanding of their place. Men who valued their female colleagues on session would lose those working relationships. The dynamics affect everyone, not just those directly excluded.
Experience of comparable churches: When the Christian Reformed Church opened all offices to women in 1995, 36 churches with approximately 7,500 members left the denomination. Governance changes on this question fracture communities in both directions. In churches where relationships are strong and consultation is genuine, the change may be absorbed without visible damage. But in churches where relationships are already strained, where women feel undervalued, or where the debate has been conducted without listening to the people most affected, the formal act of closing the door can crystallise tensions that were previously manageable. The proposed change applies uniformly to every congregation regardless of its relational health.
Questions worth sitting with
Think about the women in your congregation who currently serve in any form of leadership — as elders, as ministry workers, as volunteers who carry real responsibility. How would they experience this change? Have you asked them?
If your church has a woman elder: what is it like for her to serve on a session alongside colleagues who may have supported the overture to end her office? What does that dynamic do to the working relationship, the trust, and the ability to govern together during the transition?
Are there people in your church who might leave if this change is made? Are there people who might leave if it isn’t? What would either departure mean for your community?
Even in churches without women elders, the permission for women to serve sends a signal about how the church values women’s gifts. Removing that permission also sends a signal. How would the women and girls in your church receive it?
Paul writes in 1 Corinthians 12 that the body cannot say to any member, “I have no need of you.” How does your church currently say to its women, “We need you”? Would that change?
Section 3: Wellbeing and Safety
Survey Q1: “Do you believe the proposed change will impact on your emotional or psychological wellbeing?”
Evidence: Research on volunteers consistently finds that burnout and disengagement are driven by role ambiguity, lack of voice, and exclusion from decision-making (Allen & Mueller, 2013). Psychosocial safety research shows that harm from organisational change often emerges over time rather than immediately. This is relevant here: for some women, the impact of this change may not be felt at the point of announcement but later — as the change is experienced in practice, in session meetings, in pastoral care decisions, in the gradual realisation that a door has been permanently closed. People respond differently to institutional change; the survey is asking about your response, and it is worth being honest about what that might be over time, not just right now.
Questions worth sitting with
This question asks about your wellbeing. Take a moment to sit honestly with that. Not what you think the right answer is. Not what you think your church expects you to feel. What do you actually feel when you imagine this change taking effect?
If you are a woman: does this change affect your sense of belonging in your church? Your sense that your contribution is valued? Your willingness to serve?
If you are a woman currently serving as an elder: how does it feel to know that your denomination is considering a change that says no other woman should hold the role you hold? How will it affect your experience of serving in a role that your denomination has decided should not exist for anyone who comes after you? Has anyone asked you?
If you are a man: how do you think the women in your church would answer those questions? Have you asked them directly?
Genesis 3:16 describes a distortion of the relationship between men and women as a consequence of the Fall. Complementarian theologians including John Piper and Al Mohler acknowledge that male authority, post-Fall, tends toward sinful domination. If that’s true — and it’s in our own literature — what governance structures guard against that tendency? Does this change strengthen or weaken them?
Survey Q2: “Do you currently feel safe and supported in your local church environment?”
Evidence: The PCNSW’s Breaking the Silence program has been in place since 1997 and is mandatory for elders, ministers, and those working with children or vulnerable people. BTS is more developed than many people realise. Foundations training runs on a three-year cycle with annual Read and Review refreshers for those working with children. The CPU conducts compliance audits. The online training requires an 80% quiz score for satisfactory completion. This is real work done by serious people.
The harder question is not whether BTS exists, but what public evidence is available that it changes reporting behaviour and outcomes over time. The quiz is a knowledge check — it tests whether you know what to do, not whether you can do it. It does not assess the ability to actually handle a disclosure, manage a conflict of interest, or respond to a vulnerable person in distress. In 28 years, no public independent evaluation of BTS effectiveness or published outcome data has been identified. The system may work well — the training, audit, and complaint-handling machinery is real. But no public independent outcome evaluation has been identified showing whether BTS improves reporting behaviour and outcomes over time, and that gap matters if we are about to remove another layer of structural accountability.
A 2025 peer-reviewed study in Child Abuse & Neglect (Hunt, Higgins & Willis) interviewed 20 Christian leaders across denominations about safeguarding training and found a compliance-focused mentality rather than genuine culture change, with cultural resistance from some leaders who see safeguarding as unnecessary external regulation.
What our own denomination’s data says: The PCA’s 2020 Women’s Ministry Committee survey asked questions directly relevant to whether people feel safe and supported. More than 50% of respondents could not affirm that women who experience domestic violence know who to talk to about it within the church. More than 50% could not affirm that women trust the elders with issues specifically related to women. There were statistically significant differences between how female congregation members and male ministers answered the question of whether women trust elders with these issues. If that gap exists now — in a denomination where women can still serve as elders — what happens when that possibility is removed?
Questions worth sitting with
This is a baseline question. Answer it for how things are now, not how they might change. The honesty of this answer matters because it tells the Assembly what the starting point is.
If a woman in your church experienced harm from a male leader — bullying, harassment, spiritual abuse — where would she go? Who would she trust? Is that pathway clear, accessible, and safe?
Has your session ever discussed what would happen if someone made a complaint? Not in theory. Practically. Who handles it? What process is followed?
Do you know whether your elders and ministry workers have completed Breaking the Silence training? Could you find out?
Survey Q3: “Do you believe the proposed change could influence the amount of conflict, bullying, or harassment?”
Evidence: Bullying and harassment in organisations are not primarily caused by bad individuals. Professor Michelle Tuckey’s research at the University of Adelaide has shown that the majority traces to how an organisation is structured — its governance, its reporting lines, who holds authority, and who can challenge it. In plain terms: when your church changes who sits on session, who gets a vote, and who has formal standing, it is changing the very structures that research shows either prevent or produce harm. Tuckey’s BRIDGES at Work framework identifies formal organising arrangements as one of four subsystems that determine whether an organisation is safe. The proposed change directly alters those arrangements in every congregation.
What this looks like in churches: Research consistently shows that women in church settings already experience higher rates of burnout and lower confidence that leadership will respond to their concerns. Emerging evidence suggests women may be disproportionately affected when the organisational climate around psychological safety is poor (Amoadu et al., 2024). If the conditions are already difficult for women in churches, a governance change that reduces their formal voice does not make those conditions better.
The experience of comparable churches: These are not distant examples. The Royal Commission found that in some religious institutions — institutions led by people who believed they were serving God faithfully — the absence of women in governance negatively affected accountability in child-safety contexts. The Southern Baptist Convention’s Guidepost report (2022) documented over 700 accused abusers under exclusively male governance. The Presbyterian Church in America’s 2023 General Assembly — an all-male body — voted down all four abuse-prevention proposals put before it. The question is not whether our people would do the same. It is what structures we have in place to make sure they don’t have to.
Questions worth sitting with
Has your church experienced conflict, bullying, or harassment? If so, how was it handled? Would the proposed change make that handling better or worse?
The Reformed tradition teaches total depravity — not that every person is as bad as they could be, but that sin touches every part of human life, including the exercise of authority. We build structures — plurality of elders, higher courts, the right of appeal — because we know that good intentions are not a sufficient safeguard. Does this change add accountability or remove it?
The Royal Commission heard from more than 4,000 survivors of abuse in religious institutions. Its findings about male-only governance are not theoretical. They are based on evidence from institutions that believed they were serving God faithfully. How should those findings inform your answer?
Think about specific scenarios: a woman experiencing domestic violence, a child safety concern raised by a female volunteer, a complaint about a male leader’s conduct. Under the proposed arrangement, who hears these? Who decides what happens next?
Survey Q4: “Do you believe that this could impact the willingness of staff or volunteers to report issues such as conflict, bullying, or harassment?”
Evidence: The Presbyterian Church in America’s Ad Interim Study Committee on Domestic Abuse and Sexual Assault (2022) documented significant concerns about women’s ability to trust male-only elder bodies with issues like domestic violence and sexism. Advisory member Ann Maree Goudzwaard noted that substantial work remains before women in the American PCA can have assurance their case will be shepherded well. Murray Capill, Dean of Ministry Development at the Reformed Theological College in Melbourne, acknowledged that “from my experience and observation, lots of us don’t do that well” at consulting women — and this in a denomination that already has male-only eldership.
Governance research: There is a reasonable concern, supported by governance and institutional abuse research, that when the people receiving complaints share the demographic profile of the people being complained about, reporting confidence is affected. When there is no one on the governance body who shares the complainant’s experience, trust may diminish. The PCA (Australia)’s own 2020 survey data — where more than half of women could not affirm trust in elders with women-specific issues — is consistent with this concern.
Questions worth sitting with
If you needed to report a concern about a male leader’s conduct, would you feel comfortable bringing it to an all-male session? Would the women in your church?
Is there currently a woman in your church’s governance who would be a first point of contact for women with concerns? If the proposed change proceeds, who fills that role? What standing would they have?
Craig Tucker’s paper notes that dealing with complaints by women against male leaders is “particularly problematic when the case is heard by a group in which only men get a vote.” Do you agree? Does your church have any mechanism to address this problem?
The survey itself directs people who are feeling impacted to “reach out to your Minister, Elders or leadership.” For some respondents, these are the very people they find it hardest to approach. Does your church have an alternative pathway?
Section 4: Support and Recommendations
Survey Q1: “What additional support or resources do you believe are needed to cultivate a safe and healthy environment if the proposed change is made?”
This is an open question, and it is your opportunity to name specific, concrete measures rather than general sentiments. The evidence suggests several areas that would need to be addressed if the change proceeds. You may find these helpful as starting points for your own answer:
From advisory models to demonstrated safeguards. Alternative models for women’s participation have been proposed and in some cases Assembly-endorsed. The GANSW 2022 resolution encouraged sessions to establish women’s advisory groups, appoint women to “co-worker” positions with associate-style privileges, include women in complaints and disciplinary processes, and appointed a Women’s Engagement Working Group. The national WMPCA deliverances (2023 GAA) went further with similar recommendations at presbytery and Assembly level. This is real work. But the mechanisms are advisory (“encourage each Session to consider”), not binding. Uptake has been slow — a growing number of churches have established women’s advisory groups, but implementation is uneven. No binding, denomination-wide, independently evaluated mechanism has been demonstrated as an effective safeguard alongside this proposed change. The model on offer is partial, and its effectiveness has not been tested. The experience of other denominations suggests that advisory and associate-style mechanisms, without formal standing, often do not deliver the voice they promise.
Independent evaluation of safeguarding training. BTS has an 80% knowledge quiz and the CPU conducts compliance audits — but after 28 years, the denomination should also be able to demonstrate publicly that its training changes behaviour and outcomes. Competency-based assessment (testing the ability to handle a disclosure, not just know about one), independent external evaluation, and published outcome data would allow the denomination to know — in a measurable sense — whether its safeguarding system works.
An adequately independent complaints pathway. The PCNSW does have a Conduct Protocol Unit and a contact-person pathway intended to provide independence from the local church. The live question is whether that pathway is sufficiently independent from the denomination itself, sufficiently visible to women and volunteers, and sufficiently trusted — particularly for complaints about male leaders heard by all-male bodies.
Measurable implementation of Healthy Complementarianism. The GANSW 2022 resolution encouraged sessions to establish women’s advisory groups, appoint women to “co-worker” positions, and include women in complaints and disciplinary processes, and appointed a Women’s Engagement Working Group to bring further recommendations. At the national level, the PCA’s Women’s Ministry Committee presented a detailed strategy to the 2023 GAA with 15 deliverances along similar lines — relevant context, though national deliverances do not bind the PCNSW. Dr Murray Smith, one of the architects of the overture, has acknowledged that if the church says ‘no’ to women in eldership without also saying ‘yes’ to all the ways men and women complement each other, it has only done half the job. Whether the GANSW’s own 2022 encouragements have been implemented at session level is the question the NSW Assembly should be able to answer before voting on the overture.
Learning from complementarian denominations that have tried. The Presbyterian Church in America’s DASA report (2022) is the most comprehensive safeguarding framework produced by any complementarian denomination for all-male governance. Its 220 pages cover abuse response, whistleblower protections, and mandatory background checks. But the report is non-binding, and efforts to implement even its background check recommendations have met resistance. Nowhere in the Reformed world has a denomination produced binding structural safeguards specifically designed for all-male governance. The gap between complementarian theology — which teaches that authority should protect the vulnerable — and complementarian institutional design remains vast.
Questions worth sitting with
Which of these measures does your church currently have in place? Which are absent?
Should any of these be required before the change takes effect, or are you comfortable with them being developed after?
Even if your church does not currently have women elders, the proposed change closes the door for every congregation, including those where women’s gifts in governance might be most needed — small country parishes, churches in vacancy, congregations where the work of eldership falls on very few shoulders. What support would those churches need?
The Reformers insisted on building structures before granting authority. Knox and Calvin did not consolidate power and then design accountability later. They built the accountability first. What does that principle suggest about the sequencing of this change?
Survey Q2: “Do you have any recommendations for your local church and denominational leaders regarding health and safety as it relates to diversity and inclusion?”
Questions worth sitting with
What would you need to see from your session and presbytery to have confidence that this change — if it proceeds — will not diminish the safety and wellbeing of women in your church?
Are there practical steps your church could take now, regardless of the outcome of this vote, to strengthen the voice and safety of women in your community?
What would “doing this well” look like, even if you support the change? What would “doing this badly” look like?
If your church has never had women elders, have you considered why? It may be because your session holds a theological conviction that eldership should be male-only. It may be because the congregation discerned it was not right for your context at this time. Or it may be because the question was never asked. Each of these is a different starting point, and the proposed change affects each differently — the first formalises an existing conviction, the second removes a future option, and the third closes a door that was never opened.
Section 5: Final Comments
The survey gives you an open field. If the structured questions did not capture what you most want to say, say it here. Some things worth considering:
Questions worth sitting with
Is there something about this issue that keeps you awake at night? Something you haven’t been able to say to anyone in your church? This is the place for it.
If you are uncertain about the theological question, you are allowed to say so. The survey is not asking you to resolve the exegesis. It is asking what happens to real people in real churches if this change is made.
If you support the change but have concerns about how it is being implemented, this is the place to name those concerns. Supporting a theological position and questioning the process are not contradictory.
If your church has never had women elders, the proposed change might feel like it changes nothing for you. But consider: is there a difference between a church that has chosen its current arrangement and a church that has had the choice removed? What does that difference mean for the kind of denomination we want to be?
If you have experienced harm in a church setting — from male authority exercised without accountability, from complaints that went nowhere, from being told your concerns didn’t matter — your experience is evidence. It matters. You are not required to share it, but if you choose to, it will inform the Assembly’s understanding of what is at stake.
Section 6: Contact and Demographic Information
The survey’s contact details (name and email) are optional. However, the demographic and identifying questions earlier in the survey — your church name, your role, your gender, whether your church has female elders — are not marked as optional.
The committee has stated that when presenting results to the Assembly, it will aggregate the data so that no individual responses can be identified by anyone outside the survey process, including by the Assembly itself.
It is still worth being aware, however, that in smaller congregations a combination of church name, role, and gender may be enough to make a respondent identifiable within the survey data itself — even without their name attached. If the survey is asking whether you trust the current governance structures to keep you safe, and your answers could be connected to you by those with access to the raw data, that may affect how freely you respond. This is not a reason not to complete the survey — your voice matters. But it is worth knowing.
The survey notes that “certain State or Federal legislation may require the denomination to share these details.” This is a standard data-handling disclosure. If you have concerns about confidentiality, you may choose not to provide your name or email while still completing the substantive sections.
If you are feeling particularly impacted by this issue, Jericho Road provides access to counselling services. You do not need to go through your minister or elders to access this.
A note on the consultation process
The Assembly has charged sessions with responsibility for engaging in this consultation properly. That charge should be taken seriously. If your session has not discussed this survey with your congregation, has not made time for conversation about its questions, or has not explained how your responses will be used, you may wish to raise that directly.
Consultation that meets the legal and moral standard our denomination has set for itself requires more than distributing a link. It requires making space for people to be heard, especially those who may find it hardest to speak. In some churches, that will mean actively reaching out to women, to newer members, to those who serve faithfully but have never been asked what they think about how the church is governed.
Adams, R.B. & Ferreira, D., “Women in the Boardroom and Their Impact on Governance and Performance,” Journal of Financial Economics 94, no. 2 (2009): 291–309. doi.org
Allen, J.A. & Mueller, S.L., “The revolving door: A closer look at major factors in volunteers’ intention to quit,” Journal of Community Psychology 41, no. 2 (2013): 139–155. doi.org
Amoadu, M., Ansah, E.W. & Sarfo, J.O., “Preventing workplace mistreatment and improving workers’ mental health: A scoping review of the impact of psychosocial safety climate,” BMC Psychology 12, 195 (2024). pmc.ncbi.nlm.nih.gov
Australian Royal Commission into Institutional Responses to Child Sexual Abuse, Final Report, Volume 16: Religious Institutions (2017). childabuseroyalcommission.gov.au
Breaking the Silence, Foundations Training Workbook, 2024 Edition. breakingthesilence.org.au
Buse, K., Bernstein, R.S. & Bilimoria, D., “The Influence of Board Diversity on Nonprofit Governance Practices,” Journal of Business Ethics 133, no. 4 (2016): 179–191.
Catalyst, The Bottom Line: Connecting Corporate Performance and Gender Diversity. catalyst.org
Dollard, M.F. & Bakker, A.B., “Psychosocial safety climate as a precursor to conducive work environments,” Journal of Occupational and Organizational Psychology 83 (2010): 579–599.
Guidepost Solutions, Report of the Independent Investigation: The SBC Executive Committee’s Response to Sexual Abuse Allegations (2022). guidepostsolutions.com
Hunt, G.R., Higgins, D.J. & Willis, M.L., “‘Just tick the box and move on’: Australian Christian religious leaders reflect on safeguarding practices,” Child Abuse & Neglect 167 (2025). doi.org
Janis, I.L., Victims of Groupthink (1972). Houghton Mifflin.
Jensen, J., “The Functional Female Officer Report,” Save the PCA (2025). savethepca.com
Ling, V., 2023 Clergy Wellbeing Research Report. effectiveserving.com.au
Ling, V., “Evidence Base Research Map: Psychosocial Safety, Governance & Gender in the Presbyterian Church of NSW” (2026).
Nebbs, A., Psychosocial hazard management in regional volunteer involving organisations (2022). volunteeringstrategy.org.au
PCA (America) Ad Interim Study Committee on Domestic Abuse and Sexual Assault, DASA Report (2022). pcaga.org
PCNSW Women’s Ministry Committee, Assembly Resolution: Co-Heirs and Co-Workers (GANSW, July 2022). pcnswwomen.org.au
Smith, M. & Wright, F., Overture (xii): From the Special Committee on Elders and Deacons to amend The Code Part II 4.02(c) concerning male only elders (2026).
Tucker, C., Why Should We Stick With The Status Quo And Retain Female Elders? (Feb 2026). Scots Church Sydney.
Tuckey, M.R. et al., “Workplace bullying as an organizational problem,” Journal of Occupational Health Psychology 27, no. 6 (2022): 544–565. doi.org
Tuckey, M.R. et al., BRIDGES at Work (2025). bridgesatwork.au
Work Health and Safety Act 2011 (NSW). legislation.nsw.gov.au
Women’s Ministry Committee of the Presbyterian Church of Australia, On Men and Women in Ministry and Leadership in the PCA: A Report for the 2023 GAA (2023). wmpca.org.au
Yager, A., “Before We Change the Lock, We Should Build the Door,” andrewyager.com (12 April 2026). andrewyager.com
This document is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). You are free to share and adapt this material for any purpose, provided you give appropriate credit.
The Presbyterian Church of NSW is consulting with its people about whether to remove women from the eldership. If you’re a member of a PCNSW church, you may have been pointed to a <a href="https://www.surveymonkey.com/r/WHSEldershipSurvey2026">WHS survey</a> asking you to assess the impact of the proposed change on wellbeing, culture, and safety. The survey is clear about what it’s doing: “The purpose of the survey is not to assess whether individuals agree or not with the proposed change but to assess the impact if all future elders are male.”
That’s the right question. And it’s a question the Assembly has asked through a Consultative Committee specifically established for this purpose. But outside the survey itself, I haven’t seen much public analysis of the evidence that bears on exactly these questions — the governance research, the Royal Commission findings, the experience of sister denominations, the state of our own safeguarding training. The survey asks whether the change could influence conflict, bullying, or harassment. It asks whether it could affect willingness to report. It asks what additional support would be needed. These are serious questions. They deserve serious answers grounded in evidence, not just feelings.
This post is my attempt to fill that gap — to lay out the evidence that bears on the questions the survey is asking — not to claim the answer is settled, but to argue the risks are serious enough that the Assembly should not proceed on trust alone. Not because the theological arguments don’t matter — they do — but because a church that changes its governance structure without counting the cost isn’t being faithful. It’s being reckless.
I should say plainly: I sit on the status quo side of this debate. I think the current arrangement — male ministers, male and female ruling elders — has served the PCNSW well, and I’m not persuaded that the case for change has been made. But this post isn’t primarily about which side is right on the exegesis. It’s about something I think both sides should care about: the consultation is asking people to assess impacts on safety, wellbeing, and culture, but it provides no evidence, no framework, and no guidance for thinking about those questions. People are being asked to answer a governance and safeguarding question armed with nothing but instinct. That’s not good enough for a decision this consequential.
I’m not an elder. I don’t have standing in the Assembly. I’m a member and lay leader in the PCNSW — someone who sits in the pew, serves where I can, and cares about what happens to the people around me. I’ve spent time working on child safety policy in community organisations, and that work has shaped how I think about governance, authority, and what happens when institutions get the structures wrong.
This post is long. The issue deserves it. I’ll try to lay out both sides of the argument fairly before explaining why I think the proposed change — even if you accept its theological premises — is being pursued in the wrong order and without the safeguards our people need.
Not all the evidence below carries the same weight. Royal Commission findings and Assembly documents are primary sources. Peer-reviewed research is stronger than commentary. I use overseas denominational examples and advocacy material to illustrate implementation risks, not to predict NSW outcomes.
<h3>What’s actually being proposed</h3>The <a href="https://drive.google.com/file/d/1BkHlYWJSkuZf-fgLcUEjDkkxtifIm_CW/view">overture from the Special Committee on Elders and Deacons</a> would insert a single word — “man” — into the PCNSW Code §4.02(c), so that the qualification for eldership reads: “be a mature Christian <strong>man</strong>, who demonstrates exemplary Christ-like character as defined by the Scriptures (1 Tim. 3:1–7; Tit. 1:5–9).”
Women currently serving as elders would be grandfathered. No new women could be ordained. Every church court in the denomination — session, presbytery, and Assembly — would become exclusively male. NSW is the last Australian state that permits women ruling elders. The other states moved to male-only eldership between the late 1990s and early 2000s, so the question isn’t hypothetical. We can look at what happened elsewhere.
<h3>This is not a debate about biblical authority</h3>Both sides need to hear this, because both sides sometimes talk past it.
<a href="https://docs.google.com/document/d/1m8cPjDMMJh74v89cYIchQundkBNCOxc8iYv1Hjr2cyg/edit">Craig Tucker’s paper</a>, endorsed for wide distribution and discussion by more than thirty ministers and elders, puts it plainly: “The current debate in NSW is not a debate between those who believe in the authority of the Bible and those who do not.” Both sides affirm biblical authority. Both affirm complementarianism. Both agree that pastoral ministry should be restricted to men. Both agree that women may speak, argue from Scripture, and persuade at every level of church government.
The entire practical disagreement comes down to whether a woman may cast a vote in a session meeting.
That’s it. One vote in a committee. And yet this question has consumed years of denominational energy, generated thousands of pages of argument, and is now heading for a vote that will permanently reshape how the church governs itself.
<h3>The question everyone should be asking</h3>The exegetical arguments on both sides are well-developed, and I’ll summarise them. But the question that determines everything is one of Presbyterian polity, not Greek vocabulary: <strong>which biblical texts define the ruling elder’s office?</strong>
<h4>The Smith/EDC position</h4>Dr Murray Smith’s <a href="https://drive.google.com/file/d/1fl2zto9xa7TlN9kOFv8JAJxgOmJBCtKV/view">case for male-only eldership</a> argues that PCNSW elders are biblical elders, defined by the overseer passages in 1 Timothy 3 and Titus 1. The PCNSW Code already cites those passages. The Code already describes elders using the specific language of those passages — “shepherds” under “the Chief Shepherd,” exercising “oversight and government,” “competent to teach.” If those texts define the office, and those texts require the officeholder to be male, then the conclusion follows.
This is the overture’s most effective argument, because it turns the Code against the status quo. Code §4.02(c) says elders must demonstrate character “as defined by the Scriptures (1 Tim. 3:1–7; Tit. 1:5–9).” If you’ve already legislated that those texts define your elders, you’ve already accepted the premises. The overture just draws the conclusion.
Smith also argues that 1 Timothy 2:12 — Paul’s prohibition on women “teaching or exercising authority over men” — anticipates the elder qualifications in chapter 3, tying the two passages together. The prohibition is grounded in creation order (1 Tim. 2:13–14), not cultural circumstances, and therefore applies permanently. The complementarian scholarly consensus behind this reading is strong — Andreas Köstenberger’s syntactical analysis of the Greek, Thomas Schreiner’s work on the creation-order grounding, Wayne Grudem’s systematic case, Douglas Moo’s definitive statement that the restrictions are “permanent, authoritative for the church in all times and places.” These aren’t fringe voices. They represent the mainstream of evangelical complementarian scholarship.
The “husband of one wife” requirement (1 Tim. 3:2; Tit. 1:6) adds weight. Paul uses the ordinary Greek words for “man” and “woman,” and applies the female equivalent — “wife of one husband” — to enrolled widows in 1 Timothy 5:9, confirming that the male phrasing is gender-specific rather than generic.
<h4>The Tucker position</h4>Tucker’s response isn’t egalitarian. It’s a polity argument. He contends that the Westminster Assembly in 1645, when it wrote the <a href="https://en.wikipedia.org/wiki/Form_of_Presbyterial_Church-Government">Form of Presbyterial Church Government</a>, deliberately grounded its “Other Church-Governors” (ruling elders) in different texts from its pastors — specifically Romans 12:8 and 1 Corinthians 12:28, which describe a gift of governance, not the overseer passages of 1 Timothy 3 and Titus 1.
George Gillespie, whose work shaped the Westminster Assembly’s thinking on this point, cited Titus 1:6–8 for the ruling elder’s character but deliberately left out verse 9 — the verse about teaching. He wasn’t being careless. He was making a distinction. The vote at the Assembly passed with only two dissenters, neither of whom wanted the overseer texts applied to governors.
Tucker documents that when the 1967 GAA opened eldership to women, the resolution explicitly cited this Westminster distinction: it “holds the doctrine of the Eldership as set forth in the Westminster Form of Presbyterian Church Government, under the heading ‘Other Church Governors'” and on that basis proposed that women could serve. The 1997 GAA confirmed this was a matter of government, not doctrine.
So when the overture is framed as “returning to our roots,” Tucker’s counter is that the roots being invoked aren’t Westminster’s 1645 settlement but Robert Dabney’s 1860 reinterpretation. Iain Murray, the respected Banner of Truth editor, acknowledged this: “My personal opinion is that the [Dabney Model] has often found acceptance among us because we assumed it was the position biblically established by the Westminster Assembly. The truth is that the assumption is wrong.”
There’s also a separate biblical case — not Tucker’s primary argument, but present in the <a href="https://docs.google.com/document/d/1RMSkEVBLgXrgulJRQT52lv01CyQSuNkm/edit">“no change” paper</a> circulating among members. It points to women exercising authority throughout Scripture: Deborah governing as prophet and judge (Judges 4–5), Huldah speaking God’s word to kings (2 Kings 22–23), Phoebe called <em>diakonos</em> — the same word Paul uses for himself (Romans 16:1), Junia called “outstanding among the apostles” (Romans 16:7), women as the first witnesses and proclaimers of the resurrection. The “husband of one wife” language, on this reading, is an idiom about marital fidelity, not gender exclusivity — just as “you shall not covet your neighbour’s wife” (Exodus 20:17) doesn’t condone women coveting their neighbour’s husband. I find some of these arguments more persuasive than others, but they’re made by people who take Scripture seriously, and dismissing them as liberal is lazy.
<h4>The Code catches both sides</h4>Here’s what makes this genuinely hard. The Smith side has a strong textual argument from the Code as currently written. The Tucker side has a strong historical argument about legislative intent — what the Assembly meant when it wrote those words. The eight-year process (2015–2023) that produced the current Code language considered this question and chose not to insert “man.” That’s not an accident.
Both arguments have integrity. The question is what you weigh more: what the text says, or what the people who wrote it intended.
<h3>Even if Smith is right, the order is wrong</h3>This is where I part company with the overture. Not on the exegesis — reasonable people disagree on that, and I respect the scholarship on both sides. But on the sequencing.
There’s a distinction worth naming first: most PCNSW congregations don’t currently have women elders. For many churches, the practical effect of the overture would be small. But there is a difference between a church that has chosen not to elect women elders and a church that is prohibited from doing so. The current Code preserves the freedom of each session to discern, under Scripture and the Spirit, whether women should serve. The proposed change removes that freedom permanently — not just for churches that have women elders now, but for every future congregation that might discern that call differently.
The overture proposes to make all church courts exclusively male without first establishing any concrete mechanism for women’s participation in governance. No alternative has been designed, consulted on, voted on, or tested. The Assembly is being asked to remove an existing structure and trust that something will replace it.
The experience of every other state that’s done this, and every comparable denomination overseas, says the replacement doesn’t come. Or it comes as what Lauren Riske called “shadow elders” in <em>AP Magazine</em> — women invited to attend but not vote, burdened with responsibility but stripped of standing. A two-tier system that satisfies nobody.
In 2020, the <a href="https://www.wmpca.org.au/2023-gaa-paper">Women’s Ministry Committee of the Presbyterian Church of Australia</a> — our own denomination — surveyed 484 people (324 women, 162 men) across the PCA and presented its findings to the 2023 General Assembly of Australia. More than 50% of all respondents — men and women — could not affirm that women trust the elders with issues specifically related to women such as domestic violence and sexism. Less than 50% of women believed they were consulted by elders regarding church direction. More than 30% of women felt limited in what they could do. On multiple questions, the gap between how female congregation members and male ministers perceived the same reality was statistically significant. This is not overseas data. It comes from our own churches, and it paints a picture of a communication gap that ministers and elders need to take seriously — the report’s own words.
Murray Capill, Dean of Ministry Development at the Reformed Theological College in Melbourne, acknowledged that “from my experience and observation, lots of us don’t do that well” at consulting women.
Overseas examples are suggestive, not dispositive. But the patterns are consistent enough to take seriously. The Save the PCA “<a href="https://www.savethepca.com/ffo/">Functional Female Officer Report</a>” (2025) — published by a conservative advocacy group within the American Presbyterian Church in America, but documenting observable practices — found that numerous PCA (America) churches have created quasi-official workarounds: women leading worship elements, sitting in session meetings as “elder advisors,” going through officer training, being commissioned with public vows — all without formal office. Male-only eldership, in practice, generates constant pressure toward informal arrangements that satisfy nobody. The rules say one thing; the practice says another.
The Christian Reformed Church’s experience tells a different story. The CRC opened all offices to women on a local-option basis in 1995, and 36 complementarian churches with approximately 7,500 members left to form the United Reformed Churches in North America. The denomination never recovered those numbers. The lesson cuts both ways: this question is deeply divisive in either direction. But the PCNSW already has 35 years of settled practice. The disruption flows from changing that practice, not from maintaining it.
<h3>The Royal Commission said this, specifically</h3>The <a href="https://www.childabuseroyalcommission.gov.au/final-report">Australian Royal Commission into Institutional Responses to Child Sexual Abuse</a> (2013–2017) examined over 4,000 survivors’ accounts from religious institutions. Volume 16, devoted entirely to religious organisations, found — as a matter of evidence — that the absence or insufficient involvement of women in leadership and governance negatively affected decision-making, accountability, and may have contributed to inadequate institutional responses to child sexual abuse.
That’s not a recommendation. It’s a finding. Based on evidence. From the most significant institutional abuse inquiry in Australian history.
The Commission identified clericalism — the elevation of ordained or ministerial status above laity — as a significant contributing factor in some religious institutions. It found that insufficient involvement of women in leadership and governance negatively affected decision-making and accountability. <a href="https://www.childabuseroyalcommission.gov.au/religious-institutions">Recommendation 16.37</a> called for child-safety advisory mechanisms that include lay men and women with relevant expertise. The Commission received allegations relating to 106 cases in Presbyterian Church institutions across Australia. We are not exempt.
A church that votes to make all its governance bodies exclusively male in 2026 — nine years after that Final Report — cannot claim it didn’t know.
<h3>The governance research raises a serious caution</h3>Tucker’s paper notes that “it is acknowledged on all sides of the debate that governance bodies comprising men and women working together make better decisions and achieve better outcomes.” The Elders and Deacons Committee’s own 2021 report cited the same evidence. The academic literature is more cautious than either side acknowledges — an ANU review found that most studies do not convincingly isolate a causal effect of board diversity on overall organisational performance. But the evidence for narrower effects — improved monitoring, better attendance, stronger scrutiny, and greater sensitivity to ethical risk — is robust. The most that can responsibly be said is that removing women from governance may carry governance costs, especially around challenge and accountability. The research raises a serious caution. It does not settle the polity question by itself.
Adams and Ferreira’s 2009 study in the <em>Journal of Financial Economics</em> found that female directors have better attendance records, that male directors attend more reliably when boards are gender-diverse, that women are more likely to join monitoring committees, and that CEO accountability increases measurably. An important nuance: they also found that the effect depends on existing governance quality. In well-governed organisations with strong accountability already in place, adding diversity provides diminishing returns. But in poorly-governed organisations — where scrutiny is weak and challenge is absent — gender diversity adds significant value. The question for the PCNSW is whether all our sessions are in the well-governed category. The denomination has never measured this.
Irving Janis’s foundational work on groupthink identified homogeneity of membership as a key antecedent condition — the very thing that makes groups stop questioning themselves. Session meetings are governance meetings. They make decisions about people’s lives — pastoral care, discipline, complaints, child safety.
The proponents of male-only eldership would argue, fairly, that if Scripture requires it, then pragmatic research can’t override a clear biblical command. I agree with the principle. But it loops back to the polity question: if the Westminster model is correct and ruling elders are grounded in different texts, then there <em>is no</em> clear biblical command to override the research. The governance evidence becomes a relevant consideration in an area where Scripture gives the church freedom to order its government wisely.
<h3>Compliance is not comprehension</h3>Someone will point out that we have Breaking the Silence. They’re right. <a href="https://breakingthesilence.org.au/">BTS</a> has been in place since 1997 — one of the earliest church safeguarding programs in Australia. It’s mandatory for elders, ministers, and anyone working with children or vulnerable people. It covers child protection, domestic violence, abuse of authority, mandatory reporting. It’s been updated regularly, expanded to cover DFV in 2020, and mapped to the Royal Commission’s Child Safe Standards. The Conduct Protocol Unit is professionally staffed. This is real work done by serious people.
BTS is more developed than many people realise. Foundations training runs on a three-year cycle with annual Read and Review refreshers for those working directly with children and young people. The CPU conducts compliance audits across churches. The online training requires an 80% quiz score for satisfactory completion, and reattempts are permitted. An independent contact person option exists for reporting abuse via the CPU, providing a pathway that does not run through the local session. This is a real and serious safeguarding architecture.
The harder question is not whether BTS exists, but what public evidence is available that it changes reporting behaviour and outcomes over time. The quiz tests knowledge — whether you know the right answer — but not competency: whether you can handle an actual disclosure, manage a conflict of interest, or respond appropriately to a vulnerable person in distress. After 28 years of operation, I have not found a public independent evaluation of BTS effectiveness or published outcome data that would let anyone assess whether the training changes behaviour. The system may well work. But we can’t currently demonstrate that it does, and that gap matters more if we’re about to remove another layer of structural accountability.
A 2025 peer-reviewed study in <em>Child Abuse & Neglect</em> (Hunt, Higgins & Willis, Australian Catholic University) interviewed 20 Christian leaders across denominations about safeguarding training and titled their findings “Just tick the box and move on.” It’s a small qualitative study — not a definitive evaluation of any one program, and not specific to BTS — but the patterns it identified are worth taking seriously: compliance-focused mentality rather than genuine culture change, cultural resistance from leaders who see safeguarding as unnecessary external regulation, and outright hostility from some volunteers. Those patterns describe a risk that any church safeguarding program needs to guard against.
The Anglican Diocese of Sydney enforces a hard rule: if your safeguarding training lapses and isn’t renewed within 30 days, you step down from ministry entirely. The Catholic Church runs formal external audits against national safeguarding standards with registered auditors. We have neither enforcement mechanism.
The WHS survey asks directly: “Do you believe that this could impact the willingness of staff or volunteers to report issues such as conflict, bullying, or harassment?” There is a reasonable concern, supported by governance and institutional abuse research, that when the people receiving complaints share the demographic profile of the people being complained about, reporting confidence is affected. Our own denomination’s 2020 WMPCA survey is consistent with this concern — even men in the PCA couldn’t affirm that women trust elders with issues like domestic violence. That wasn’t just a women’s perception. It was a denomination-wide finding. We don’t need to guess what the risks might be. We have data from our own churches.
<h3>Genesis 3:16 — the text complementarians should take most seriously</h3>This argument doesn’t require an egalitarian theology. It requires taking Genesis 3 seriously within the complementarian framework.
“Your desire shall be contrary to your husband, and he shall rule over you.” Both sides agree this describes a distortion — what sin does to the relationship between men and women, not what God designed.
John Piper, co-founder of the Council on Biblical Manhood and Womanhood, calls this a description of the curse. He identifies what he calls “the essence of corrupted maleness” as the self-aggrandising effort to subdue and control and exploit women. Calvin read it as introducing a harsher subjection than existed before the Fall. Susan Foh, whose 1975 interpretation in the <em>Westminster Theological Journal</em> most complementarians follow, concluded that the rule of love founded in paradise is replaced by struggle, tyranny, and domination.
Even CBMW’s Jonathan Leeman conceded that views of male headship can contribute to partner violence. Al Mohler said complementarian theology “can be, and it sometimes is” used to abuse women.
So here’s the question we need to sit with: if you believe Genesis 3:16 predicts that male authority, post-Fall, will tend toward sinful domination — and you do believe that, it’s in your own literature — then what governance structures guard against that predictable distortion? The answer can’t be “godly men will lead like Christ.” That’s the aspiration. The question is what happens when they don’t. Because sometimes they don’t. The SBC’s 700-name database of accused abusers says they don’t. The Presbyterian Church in America’s 2023 General Assembly, where an all-male body voted down all four abuse-prevention proposals, says they don’t.
Removing women from governance concentrates authority in exactly the group most susceptible to the corruption Genesis 3 describes, while eliminating the structural accountability that diverse governance provides. That should bother every complementarian in the room.
<h3>What would need to be true before this change could responsibly be made</h3>I’m not saying “never.” I’m saying “not yet.” And I’m saying the burden of proof is on those proposing the change to show that these conditions are met:
<strong>An alternative mechanism for women’s participation in governance has been designed, consulted on, and voted on by the Assembly — and published before the vote on the overture, not promised afterward.</strong> Not promised. Not recommended. Voted on, with specifics. What is it? How does it work? What standing do women have? Can they merely advise, or do they have constitutive power? These aren’t details to work out later. They’re the whole question.
<strong>Breaking the Silence has been independently evaluated for effectiveness.</strong> BTS has an 80% knowledge quiz, compliance audits, and a contact-person pathway — but after 28 years, the denomination should also be able to demonstrate publicly that its training changes behaviour and outcomes. Competency-based assessment, independent external evaluation, and published outcome data would allow the denomination to know — in a measurable sense — whether its safeguarding system works. If it does, prove it. If it doesn’t, fix it before you remove the structural safeguard that partly compensates for the gap.
<strong>An independent complaints mechanism, accessible to women and not solely administered by male office-bearers, has been established and resourced.</strong> The CPU and contact-person pathway already provide some independence from the local session. The live question is whether that pathway is sufficiently independent from the denomination itself, sufficiently visible to women and volunteers, and sufficiently trusted — particularly for complaints about male leaders heard by all-male bodies. Dealing with complaints by women against male leaders is, as Tucker’s paper notes, “particularly problematic when the case is heard by a group in which only men get a vote.”
<strong>The church has mapped which pathway handles which class of concern.</strong> The WHS survey asks about conflict, bullying, harassment, and willingness to report. BTS is designed for abuse, abuse of authority, and child safety — and it does that work seriously. But not every concern the survey names is a BTS matter. If the Assembly is asking members to assess broader risks, it should be able to show where a complaint about session conflict goes, where a bullying concern about a ministry leader is heard, and what independent options exist in each case. These pathways need to exist, be published, and be known — not assumed.
<strong>The Healthy Complementarianism resolutions adopted by the GAA have been implemented at session level across PCNSW presbyteries, with measurable outcomes reported annually to the Assembly.</strong> Dr Smith’s own framing: if we say ‘no’ to women in eldership without also saying ‘yes’ to all the ways men and women complement each other, we’ve only done half the job. <em>The Other Cheek</em> observed that the Healthy Complementarianism paper “is clear on how to change the church code to bring about male elders, but there are no worked-out solutions for the ideas to support women.” The ‘yes’ doesn’t exist yet. Build it before you legislate the ‘no.’
<h3>The deeper irony</h3>Presbyterian governance was designed, from its inception, to prevent the concentration of power. Calvin insisted on lay participation alongside clergy. Knox built Scottish polity around elder parity and appeals to higher courts. The whole system exists because the Reformers understood — on the basis of total depravity — that unchecked authority corrupts.
The ruling elder exists as a representative of the congregation. Not an extension of the minister’s authority. A representative. If ruling elders represent the congregation, and the congregation includes women, then a governance body that systematically excludes women has a representational deficit built into its foundation.
Paul’s argument in 1 Corinthians 12 insists that the body cannot say to any member, “I have no need of you.” The ear cannot be told it has no place because it is not an eye.
The proposed change tells more than half the body of Christ that their voice has no formal standing in the governance of their own church. Not because they lack character, competence, or calling. Because of their sex. That should trouble us — not because complementarianism is wrong, but because this particular expression of it contradicts the very logic of the governance system we claim to value.
The evidence in this post has limits. Some of it is institutional and direct; some is comparative and indirect; some comes from advocacy sources I’ve used only as illustration. I’m not claiming the future can be read off a single dataset. I’m arguing that the risks are credible enough, and the safeguards underdeveloped enough, that the Assembly should not make this change on trust.
<h3>Where I land</h3>I don’t think the exegetical question is as settled as either side claims. The meaning of <em>authentein</em> in 1 Timothy 2:12 is genuinely contested among serious scholars. The relationship between the overseer passages and the ruling elder’s office has been debated within Presbyterianism for centuries. Whether 1 Timothy 5:17’s <em>malista</em> identifies a subgroup of teaching elders or refers to all elders is unresolved. These are hard questions with honest disagreement on all sides.
But even if I’m wrong about the exegesis, I’m not wrong about the sequencing. You don’t remove a structural safeguard before you’ve built the replacement. You don’t tell women they’ll be looked after by the same system that, in denomination after denomination, has demonstrably failed to look after them. You don’t vote to concentrate authority in the hands of one sex while knowing what the Royal Commission found about what that does.
If we must change the lock, let’s first build the door. The people who will bear the cost of getting this wrong are the ones with the least power to do anything about it.
I’ve talked to women in our churches who accept complementarianism and still feel afraid of this change. Not because they think it’s theologically wrong — they’re genuinely uncertain about that — but because they’ve seen what happens when institutions consolidate male authority without structural accountability. Some of them have experienced it personally. When they hear “godly men will lead like Christ,” they hear a promise that has been broken before. Not by bad theology, but by bad men operating within good theology. And they wonder who will notice, and who will act, when the governance table has no one who shares their experience sitting at it.
That’s not an argument against complementarianism. It’s an argument for taking it seriously enough to do it carefully.
The Presbyterian Church of NSW is consulting with its people about whether to remove women from the eldership. If you’re a member of a PCNSW church, you may have been pointed to a WHS survey asking you to assess the impact of the proposed change on wellbeing, culture, and safety. The survey is clear about what it’s doing: “The purpose of the survey is not to assess whether individuals agree or not with the proposed change but to assess the impact if all future elders are male.”
That’s the right question. And it’s a question the Assembly has asked through a Consultative Committee specifically established for this purpose. But outside the survey itself, I haven’t seen much public analysis of the evidence that bears on exactly these questions — the governance research, the Royal Commission findings, the experience of sister denominations, the state of our own safeguarding training. The survey asks whether the change could influence conflict, bullying, or harassment. It asks whether it could affect willingness to report. It asks what additional support would be needed. These are serious questions. They deserve serious answers grounded in evidence, not just feelings.
This post is my attempt to fill that gap — to lay out the evidence that bears on the questions the survey is asking — not to claim the answer is settled, but to argue the risks are serious enough that the Assembly should not proceed on trust alone. Not because the theological arguments don’t matter — they do — but because a church that changes its governance structure without counting the cost isn’t being faithful. It’s being reckless.
I should say plainly: I sit on the status quo side of this debate. I think the current arrangement — male ministers, male and female ruling elders — has served the PCNSW well, and I’m not persuaded that the case for change has been made. But this post isn’t primarily about which side is right on the exegesis. It’s about something I think both sides should care about: the consultation is asking people to assess impacts on safety, wellbeing, and culture, but it provides no evidence, no framework, and no guidance for thinking about those questions. People are being asked to answer a governance and safeguarding question armed with nothing but instinct. That’s not good enough for a decision this consequential.
I’m not an elder. I don’t have standing in the Assembly. I’m a member and lay leader in the PCNSW — someone who sits in the pew, serves where I can, and cares about what happens to the people around me. I’ve spent time working on child safety policy in community organisations, and that work has shaped how I think about governance, authority, and what happens when institutions get the structures wrong.
This post is long. The issue deserves it. I’ll try to lay out both sides of the argument fairly before explaining why I think the proposed change — even if you accept its theological premises — is being pursued in the wrong order and without the safeguards our people need.
Not all the evidence below carries the same weight. Royal Commission findings and Assembly documents are primary sources. Peer-reviewed research is stronger than commentary. I use overseas denominational examples and advocacy material to illustrate implementation risks, not to predict NSW outcomes.
What’s actually being proposed
The overture from the Special Committee on Elders and Deacons would insert a single word — “man” — into the PCNSW Code §4.02(c), so that the qualification for eldership reads: “be a mature Christian man, who demonstrates exemplary Christ-like character as defined by the Scriptures (1 Tim. 3:1–7; Tit. 1:5–9).”
Women currently serving as elders would be grandfathered. No new women could be ordained. Every church court in the denomination — session, presbytery, and Assembly — would become exclusively male. NSW is the last Australian state that permits women ruling elders. The other states moved to male-only eldership between the late 1990s and early 2000s, so the question isn’t hypothetical. We can look at what happened elsewhere.
This is not a debate about biblical authority
Both sides need to hear this, because both sides sometimes talk past it.
Craig Tucker’s paper, endorsed for wide distribution and discussion by more than thirty ministers and elders, puts it plainly: “The current debate in NSW is not a debate between those who believe in the authority of the Bible and those who do not.” Both sides affirm biblical authority. Both affirm complementarianism. Both agree that pastoral ministry should be restricted to men. Both agree that women may speak, argue from Scripture, and persuade at every level of church government.
The entire practical disagreement comes down to whether a woman may cast a vote in a session meeting.
That’s it. One vote in a committee. And yet this question has consumed years of denominational energy, generated thousands of pages of argument, and is now heading for a vote that will permanently reshape how the church governs itself.
The question everyone should be asking
The exegetical arguments on both sides are well-developed, and I’ll summarise them. But the question that determines everything is one of Presbyterian polity, not Greek vocabulary: which biblical texts define the ruling elder’s office?
The Smith/EDC position
Dr Murray Smith’s case for male-only eldership argues that PCNSW elders are biblical elders, defined by the overseer passages in 1 Timothy 3 and Titus 1. The PCNSW Code already cites those passages. The Code already describes elders using the specific language of those passages — “shepherds” under “the Chief Shepherd,” exercising “oversight and government,” “competent to teach.” If those texts define the office, and those texts require the officeholder to be male, then the conclusion follows.
This is the overture’s most effective argument, because it turns the Code against the status quo. Code §4.02(c) says elders must demonstrate character “as defined by the Scriptures (1 Tim. 3:1–7; Tit. 1:5–9).” If you’ve already legislated that those texts define your elders, you’ve already accepted the premises. The overture just draws the conclusion.
Smith also argues that 1 Timothy 2:12 — Paul’s prohibition on women “teaching or exercising authority over men” — anticipates the elder qualifications in chapter 3, tying the two passages together. The prohibition is grounded in creation order (1 Tim. 2:13–14), not cultural circumstances, and therefore applies permanently. The complementarian scholarly consensus behind this reading is strong — Andreas Köstenberger’s syntactical analysis of the Greek, Thomas Schreiner’s work on the creation-order grounding, Wayne Grudem’s systematic case, Douglas Moo’s definitive statement that the restrictions are “permanent, authoritative for the church in all times and places.” These aren’t fringe voices. They represent the mainstream of evangelical complementarian scholarship.
The “husband of one wife” requirement (1 Tim. 3:2; Tit. 1:6) adds weight. Paul uses the ordinary Greek words for “man” and “woman,” and applies the female equivalent — “wife of one husband” — to enrolled widows in 1 Timothy 5:9, confirming that the male phrasing is gender-specific rather than generic.
The Tucker position
Tucker’s response isn’t egalitarian. It’s a polity argument. He contends that the Westminster Assembly in 1645, when it wrote the Form of Presbyterial Church Government, deliberately grounded its “Other Church-Governors” (ruling elders) in different texts from its pastors — specifically Romans 12:8 and 1 Corinthians 12:28, which describe a gift of governance, not the overseer passages of 1 Timothy 3 and Titus 1.
George Gillespie, whose work shaped the Westminster Assembly’s thinking on this point, cited Titus 1:6–8 for the ruling elder’s character but deliberately left out verse 9 — the verse about teaching. He wasn’t being careless. He was making a distinction. The vote at the Assembly passed with only two dissenters, neither of whom wanted the overseer texts applied to governors.
Tucker documents that when the 1967 GAA opened eldership to women, the resolution explicitly cited this Westminster distinction: it “holds the doctrine of the Eldership as set forth in the Westminster Form of Presbyterian Church Government, under the heading ‘Other Church Governors'” and on that basis proposed that women could serve. The 1997 GAA confirmed this was a matter of government, not doctrine.
So when the overture is framed as “returning to our roots,” Tucker’s counter is that the roots being invoked aren’t Westminster’s 1645 settlement but Robert Dabney’s 1860 reinterpretation. Iain Murray, the respected Banner of Truth editor, acknowledged this: “My personal opinion is that the [Dabney Model] has often found acceptance among us because we assumed it was the position biblically established by the Westminster Assembly. The truth is that the assumption is wrong.”
There’s also a separate biblical case — not Tucker’s primary argument, but present in the “no change” paper circulating among members. It points to women exercising authority throughout Scripture: Deborah governing as prophet and judge (Judges 4–5), Huldah speaking God’s word to kings (2 Kings 22–23), Phoebe called diakonos — the same word Paul uses for himself (Romans 16:1), Junia called “outstanding among the apostles” (Romans 16:7), women as the first witnesses and proclaimers of the resurrection. The “husband of one wife” language, on this reading, is an idiom about marital fidelity, not gender exclusivity — just as “you shall not covet your neighbour’s wife” (Exodus 20:17) doesn’t condone women coveting their neighbour’s husband. I find some of these arguments more persuasive than others, but they’re made by people who take Scripture seriously, and dismissing them as liberal is lazy.
The Code catches both sides
Here’s what makes this genuinely hard. The Smith side has a strong textual argument from the Code as currently written. The Tucker side has a strong historical argument about legislative intent — what the Assembly meant when it wrote those words. The eight-year process (2015–2023) that produced the current Code language considered this question and chose not to insert “man.” That’s not an accident.
Both arguments have integrity. The question is what you weigh more: what the text says, or what the people who wrote it intended.
Even if Smith is right, the order is wrong
This is where I part company with the overture. Not on the exegesis — reasonable people disagree on that, and I respect the scholarship on both sides. But on the sequencing.
There’s a distinction worth naming first: most PCNSW congregations don’t currently have women elders. For many churches, the practical effect of the overture would be small. But there is a difference between a church that has chosen not to elect women elders and a church that is prohibited from doing so. The current Code preserves the freedom of each session to discern, under Scripture and the Spirit, whether women should serve. The proposed change removes that freedom permanently — not just for churches that have women elders now, but for every future congregation that might discern that call differently.
The overture proposes to make all church courts exclusively male without first establishing any concrete mechanism for women’s participation in governance. No alternative has been designed, consulted on, voted on, or tested. The Assembly is being asked to remove an existing structure and trust that something will replace it.
The experience of every other state that’s done this, and every comparable denomination overseas, says the replacement doesn’t come. Or it comes as what Lauren Riske called “shadow elders” in AP Magazine — women invited to attend but not vote, burdened with responsibility but stripped of standing. A two-tier system that satisfies nobody.
In 2020, the Women’s Ministry Committee of the Presbyterian Church of Australia — our own denomination — surveyed 484 people (324 women, 162 men) across the PCA and presented its findings to the 2023 General Assembly of Australia. More than 50% of all respondents — men and women — could not affirm that women trust the elders with issues specifically related to women such as domestic violence and sexism. Less than 50% of women believed they were consulted by elders regarding church direction. More than 30% of women felt limited in what they could do. On multiple questions, the gap between how female congregation members and male ministers perceived the same reality was statistically significant. This is not overseas data. It comes from our own churches, and it paints a picture of a communication gap that ministers and elders need to take seriously — the report’s own words.
Murray Capill, Dean of Ministry Development at the Reformed Theological College in Melbourne, acknowledged that “from my experience and observation, lots of us don’t do that well” at consulting women.
Overseas examples are suggestive, not dispositive. But the patterns are consistent enough to take seriously. The Save the PCA “Functional Female Officer Report” (2025) — published by a conservative advocacy group within the American Presbyterian Church in America, but documenting observable practices — found that numerous PCA (America) churches have created quasi-official workarounds: women leading worship elements, sitting in session meetings as “elder advisors,” going through officer training, being commissioned with public vows — all without formal office. Male-only eldership, in practice, generates constant pressure toward informal arrangements that satisfy nobody. The rules say one thing; the practice says another.
The Christian Reformed Church’s experience tells a different story. The CRC opened all offices to women on a local-option basis in 1995, and 36 complementarian churches with approximately 7,500 members left to form the United Reformed Churches in North America. The denomination never recovered those numbers. The lesson cuts both ways: this question is deeply divisive in either direction. But the PCNSW already has 35 years of settled practice. The disruption flows from changing that practice, not from maintaining it.
The Royal Commission said this, specifically
The Australian Royal Commission into Institutional Responses to Child Sexual Abuse (2013–2017) examined over 4,000 survivors’ accounts from religious institutions. Volume 16, devoted entirely to religious organisations, found — as a matter of evidence — that the absence or insufficient involvement of women in leadership and governance negatively affected decision-making, accountability, and may have contributed to inadequate institutional responses to child sexual abuse.
That’s not a recommendation. It’s a finding. Based on evidence. From the most significant institutional abuse inquiry in Australian history.
The Commission identified clericalism — the elevation of ordained or ministerial status above laity — as a significant contributing factor in some religious institutions. It found that insufficient involvement of women in leadership and governance negatively affected decision-making and accountability. Recommendation 16.37 called for child-safety advisory mechanisms that include lay men and women with relevant expertise. The Commission received allegations relating to 106 cases in Presbyterian Church institutions across Australia. We are not exempt.
A church that votes to make all its governance bodies exclusively male in 2026 — nine years after that Final Report — cannot claim it didn’t know.
The governance research raises a serious caution
Tucker’s paper notes that “it is acknowledged on all sides of the debate that governance bodies comprising men and women working together make better decisions and achieve better outcomes.” The Elders and Deacons Committee’s own 2021 report cited the same evidence. The academic literature is more cautious than either side acknowledges — an ANU review found that most studies do not convincingly isolate a causal effect of board diversity on overall organisational performance. But the evidence for narrower effects — improved monitoring, better attendance, stronger scrutiny, and greater sensitivity to ethical risk — is robust. The most that can responsibly be said is that removing women from governance may carry governance costs, especially around challenge and accountability. The research raises a serious caution. It does not settle the polity question by itself.
Adams and Ferreira’s 2009 study in the Journal of Financial Economics found that female directors have better attendance records, that male directors attend more reliably when boards are gender-diverse, that women are more likely to join monitoring committees, and that CEO accountability increases measurably. An important nuance: they also found that the effect depends on existing governance quality. In well-governed organisations with strong accountability already in place, adding diversity provides diminishing returns. But in poorly-governed organisations — where scrutiny is weak and challenge is absent — gender diversity adds significant value. The question for the PCNSW is whether all our sessions are in the well-governed category. The denomination has never measured this.
Irving Janis’s foundational work on groupthink identified homogeneity of membership as a key antecedent condition — the very thing that makes groups stop questioning themselves. Session meetings are governance meetings. They make decisions about people’s lives — pastoral care, discipline, complaints, child safety.
The proponents of male-only eldership would argue, fairly, that if Scripture requires it, then pragmatic research can’t override a clear biblical command. I agree with the principle. But it loops back to the polity question: if the Westminster model is correct and ruling elders are grounded in different texts, then there is no clear biblical command to override the research. The governance evidence becomes a relevant consideration in an area where Scripture gives the church freedom to order its government wisely.
Compliance is not comprehension
Someone will point out that we have Breaking the Silence. They’re right. BTS has been in place since 1997 — one of the earliest church safeguarding programs in Australia. It’s mandatory for elders, ministers, and anyone working with children or vulnerable people. It covers child protection, domestic violence, abuse of authority, mandatory reporting. It’s been updated regularly, expanded to cover DFV in 2020, and mapped to the Royal Commission’s Child Safe Standards. The Conduct Protocol Unit is professionally staffed. This is real work done by serious people.
BTS is more developed than many people realise. Foundations training runs on a three-year cycle with annual Read and Review refreshers for those working directly with children and young people. The CPU conducts compliance audits across churches. The online training requires an 80% quiz score for satisfactory completion, and reattempts are permitted. An independent contact person option exists for reporting abuse via the CPU, providing a pathway that does not run through the local session. This is a real and serious safeguarding architecture.
The harder question is not whether BTS exists, but what public evidence is available that it changes reporting behaviour and outcomes over time. The quiz tests knowledge — whether you know the right answer — but not competency: whether you can handle an actual disclosure, manage a conflict of interest, or respond appropriately to a vulnerable person in distress. After 28 years of operation, I have not found a public independent evaluation of BTS effectiveness or published outcome data that would let anyone assess whether the training changes behaviour. The system may well work. But we can’t currently demonstrate that it does, and that gap matters more if we’re about to remove another layer of structural accountability.
A 2025 peer-reviewed study in Child Abuse & Neglect (Hunt, Higgins & Willis, Australian Catholic University) interviewed 20 Christian leaders across denominations about safeguarding training and titled their findings “Just tick the box and move on.” It’s a small qualitative study — not a definitive evaluation of any one program, and not specific to BTS — but the patterns it identified are worth taking seriously: compliance-focused mentality rather than genuine culture change, cultural resistance from leaders who see safeguarding as unnecessary external regulation, and outright hostility from some volunteers. Those patterns describe a risk that any church safeguarding program needs to guard against.
The Anglican Diocese of Sydney enforces a hard rule: if your safeguarding training lapses and isn’t renewed within 30 days, you step down from ministry entirely. The Catholic Church runs formal external audits against national safeguarding standards with registered auditors. We have neither enforcement mechanism.
The WHS survey asks directly: “Do you believe that this could impact the willingness of staff or volunteers to report issues such as conflict, bullying, or harassment?” There is a reasonable concern, supported by governance and institutional abuse research, that when the people receiving complaints share the demographic profile of the people being complained about, reporting confidence is affected. Our own denomination’s 2020 WMPCA survey is consistent with this concern — even men in the PCA couldn’t affirm that women trust elders with issues like domestic violence. That wasn’t just a women’s perception. It was a denomination-wide finding. We don’t need to guess what the risks might be. We have data from our own churches.
Genesis 3:16 — the text complementarians should take most seriously
This argument doesn’t require an egalitarian theology. It requires taking Genesis 3 seriously within the complementarian framework.
“Your desire shall be contrary to your husband, and he shall rule over you.” Both sides agree this describes a distortion — what sin does to the relationship between men and women, not what God designed.
John Piper, co-founder of the Council on Biblical Manhood and Womanhood, calls this a description of the curse. He identifies what he calls “the essence of corrupted maleness” as the self-aggrandising effort to subdue and control and exploit women. Calvin read it as introducing a harsher subjection than existed before the Fall. Susan Foh, whose 1975 interpretation in the Westminster Theological Journal most complementarians follow, concluded that the rule of love founded in paradise is replaced by struggle, tyranny, and domination.
Even CBMW’s Jonathan Leeman conceded that views of male headship can contribute to partner violence. Al Mohler said complementarian theology “can be, and it sometimes is” used to abuse women.
So here’s the question we need to sit with: if you believe Genesis 3:16 predicts that male authority, post-Fall, will tend toward sinful domination — and you do believe that, it’s in your own literature — then what governance structures guard against that predictable distortion? The answer can’t be “godly men will lead like Christ.” That’s the aspiration. The question is what happens when they don’t. Because sometimes they don’t. The SBC’s 700-name database of accused abusers says they don’t. The Presbyterian Church in America’s 2023 General Assembly, where an all-male body voted down all four abuse-prevention proposals, says they don’t.
Removing women from governance concentrates authority in exactly the group most susceptible to the corruption Genesis 3 describes, while eliminating the structural accountability that diverse governance provides. That should bother every complementarian in the room.
What would need to be true before this change could responsibly be made
I’m not saying “never.” I’m saying “not yet.” And I’m saying the burden of proof is on those proposing the change to show that these conditions are met:
An alternative mechanism for women’s participation in governance has been designed, consulted on, and voted on by the Assembly — and published before the vote on the overture, not promised afterward. Not promised. Not recommended. Voted on, with specifics. What is it? How does it work? What standing do women have? Can they merely advise, or do they have constitutive power? These aren’t details to work out later. They’re the whole question.
Breaking the Silence has been independently evaluated for effectiveness. BTS has an 80% knowledge quiz, compliance audits, and a contact-person pathway — but after 28 years, the denomination should also be able to demonstrate publicly that its training changes behaviour and outcomes. Competency-based assessment, independent external evaluation, and published outcome data would allow the denomination to know — in a measurable sense — whether its safeguarding system works. If it does, prove it. If it doesn’t, fix it before you remove the structural safeguard that partly compensates for the gap.
An independent complaints mechanism, accessible to women and not solely administered by male office-bearers, has been established and resourced. The CPU and contact-person pathway already provide some independence from the local session. The live question is whether that pathway is sufficiently independent from the denomination itself, sufficiently visible to women and volunteers, and sufficiently trusted — particularly for complaints about male leaders heard by all-male bodies. Dealing with complaints by women against male leaders is, as Tucker’s paper notes, “particularly problematic when the case is heard by a group in which only men get a vote.”
The church has mapped which pathway handles which class of concern. The WHS survey asks about conflict, bullying, harassment, and willingness to report. BTS is designed for abuse, abuse of authority, and child safety — and it does that work seriously. But not every concern the survey names is a BTS matter. If the Assembly is asking members to assess broader risks, it should be able to show where a complaint about session conflict goes, where a bullying concern about a ministry leader is heard, and what independent options exist in each case. These pathways need to exist, be published, and be known — not assumed.
The Healthy Complementarianism resolutions adopted by the GAA have been implemented at session level across PCNSW presbyteries, with measurable outcomes reported annually to the Assembly. Dr Smith’s own framing: if we say ‘no’ to women in eldership without also saying ‘yes’ to all the ways men and women complement each other, we’ve only done half the job. The Other Cheek observed that the Healthy Complementarianism paper “is clear on how to change the church code to bring about male elders, but there are no worked-out solutions for the ideas to support women.” The ‘yes’ doesn’t exist yet. Build it before you legislate the ‘no.’
The deeper irony
Presbyterian governance was designed, from its inception, to prevent the concentration of power. Calvin insisted on lay participation alongside clergy. Knox built Scottish polity around elder parity and appeals to higher courts. The whole system exists because the Reformers understood — on the basis of total depravity — that unchecked authority corrupts.
The ruling elder exists as a representative of the congregation. Not an extension of the minister’s authority. A representative. If ruling elders represent the congregation, and the congregation includes women, then a governance body that systematically excludes women has a representational deficit built into its foundation.
Paul’s argument in 1 Corinthians 12 insists that the body cannot say to any member, “I have no need of you.” The ear cannot be told it has no place because it is not an eye.
The proposed change tells more than half the body of Christ that their voice has no formal standing in the governance of their own church. Not because they lack character, competence, or calling. Because of their sex. That should trouble us — not because complementarianism is wrong, but because this particular expression of it contradicts the very logic of the governance system we claim to value.
The evidence in this post has limits. Some of it is institutional and direct; some is comparative and indirect; some comes from advocacy sources I’ve used only as illustration. I’m not claiming the future can be read off a single dataset. I’m arguing that the risks are credible enough, and the safeguards underdeveloped enough, that the Assembly should not make this change on trust.
Where I land
I don’t think the exegetical question is as settled as either side claims. The meaning of authentein in 1 Timothy 2:12 is genuinely contested among serious scholars. The relationship between the overseer passages and the ruling elder’s office has been debated within Presbyterianism for centuries. Whether 1 Timothy 5:17’s malista identifies a subgroup of teaching elders or refers to all elders is unresolved. These are hard questions with honest disagreement on all sides.
But even if I’m wrong about the exegesis, I’m not wrong about the sequencing. You don’t remove a structural safeguard before you’ve built the replacement. You don’t tell women they’ll be looked after by the same system that, in denomination after denomination, has demonstrably failed to look after them. You don’t vote to concentrate authority in the hands of one sex while knowing what the Royal Commission found about what that does.
If we must change the lock, let’s first build the door. The people who will bear the cost of getting this wrong are the ones with the least power to do anything about it.
I’ve talked to women in our churches who accept complementarianism and still feel afraid of this change. Not because they think it’s theologically wrong — they’re genuinely uncertain about that — but because they’ve seen what happens when institutions consolidate male authority without structural accountability. Some of them have experienced it personally. When they hear “godly men will lead like Christ,” they hear a promise that has been broken before. Not by bad theology, but by bad men operating within good theology. And they wonder who will notice, and who will act, when the governance table has no one who shares their experience sitting at it.
That’s not an argument against complementarianism. It’s an argument for taking it seriously enough to do it carefully.
<p class="wp-block-paragraph"><em>Australia’s Telecommunications DFSV Standard 2025 is a landmark for victim-survivors. But it has a blind spot — and it’s the people answering the phones.</em></p>
<p class="wp-block-paragraph">On 1 July 2025, Australia’s first enforceable telecommunications standard for domestic, family and sexual violence (DFSV) came into force. The <a href="https://www.acma.gov.au/domestic-family-and-sexual-violence-standard-rules-industry" target="_blank" rel="noreferrer noopener">Telecommunications (Domestic, Family and Sexual Violence Consumer Protections) Industry Standard 2025</a>, made by the ACMA under Part 6 of the Telecommunications Act 1997, replaced a decade of voluntary guidelines with binding obligations. Full compliance for large providers was required by 1 January 2026, and small providers by 1 April 2026.</p>
<p class="wp-block-paragraph">It is, without question, a significant step forward for consumers experiencing violence. Telcos must now urgently reverse disconnections where there’s a safety risk, provide specialist DFV-trained staff, offer new accounts not linked to a perpetrator, and critically — must not require evidence of abuse or force a victim-survivor to engage with their abuser for account changes or debt resolution.</p>
<p class="wp-block-paragraph"><strong>This is good regulation. I support it.</strong></p>
<p class="wp-block-paragraph">I’m not writing this from the sidelines. <a href="https://rwts.com.au/" target="_blank" rel="noreferrer noopener">RWTS</a> was an initial signatory to the telecommunications industry DFV pledge, and we’ve been working on DFV-aware practices internally for three years — training our team, building processes, and thinking seriously about what it means for a technology services provider to handle these situations well.</p>
<p class="wp-block-paragraph">This issue came into sharp focus for me yesterday during a discussion with the <a href="https://internet.asn.au/" target="_blank" rel="noreferrer noopener">Internet Association of Australia’s</a> Public Policy Advisory Panel, where we were examining ACMA enforcement priorities for the new standard. As the conversation progressed, I kept coming back to a question that nobody in the room had a good answer for.</p>
<p class="wp-block-paragraph">We’ve spent years building protections for the people on one side of the phone call. But there is a gap in this standard that nobody appears to be talking about — and I believe it represents a real and quantifiable risk to the people on the <em>other</em> side.</p>
<h3 class="wp-block-heading">The numbers we can’t ignore</h3>
<p class="wp-block-paragraph">In April 2023, the <a href="https://www.acms.au/" target="_blank" rel="noreferrer noopener">Australian Child Maltreatment Study (ACMS)</a>— Australia’s first nationally representative prevalence study — was published in the <em>Medical Journal of Australia</em>. It surveyed 8,503 Australians aged 16+ and found that <strong>62.2% of Australians experienced at least one form of maltreatment during childhood</strong>.</p>
<p class="wp-block-paragraph">Not a small minority. Not a marginal statistic. The majority.</p>
<p class="wp-block-paragraph">The breakdown is confronting:</p>
<ul class="wp-block-list">
<li><strong>39.6%</strong> were exposed to domestic violence as children</li>
<li><strong>32.0%</strong> experienced physical abuse</li>
<li><strong>30.9%</strong> experienced emotional abuse</li>
<li><strong>28.5%</strong> experienced sexual abuse</li>
<li><strong>39.4%</strong> experienced multiple types of maltreatment</li>
</ul>
<p class="wp-block-paragraph">Separately, the ABS 2021–22 Personal Safety Survey confirmed that <strong>1 in 7 Australians (14.1%)</strong> experienced childhood physical and/or sexual abuse — the narrower but equally authoritative measure covering abuse by adults before age 15.</p>
<p class="wp-block-paragraph">Among those with maltreatment histories, <strong>48% met criteria for at least one mental disorder</strong>, compared with 21.6% of those without. A 2024 University of Sydney analysis attributed up to <strong>40% of Australia’s mental health burden</strong> to childhood maltreatment.</p>
<h3 class="wp-block-heading">The workforce the standard forgot</h3>
<p class="wp-block-paragraph">The DFSV Standard mandates two tiers of training: general DFV awareness for all personnel (Section 21), and specialised training for customer-facing staff (Section 22) covering the nature of DFV, intersectionality, and how to identify and engage with affected customers.</p>
<p class="wp-block-paragraph">There is exactly <strong>one reference</strong> to worker safety in the entire standard — a requirement that training cover “recognising and prioritising the safety of personnel engaging with perpetrators.”</p>
<p class="wp-block-paragraph">What’s missing is everything else:</p>
<ul class="wp-block-list">
<li>No mandatory employee counselling or debriefing tied to DFV disclosure handling</li>
<li>No clinical supervision or peer support structures</li>
<li>No workload management to limit DFV case exposure per worker</li>
<li>No re-traumatisation risk framework for staff</li>
<li>No requirement for Employee Assistance Program access specifically linked to this work</li>
</ul>
<p class="wp-block-paragraph">The standard requires “trauma-informed” policies and training — but this framing is <strong>exclusively consumer-directed</strong>. The same principle is not extended to the workers delivering the support.</p>
<h3 class="wp-block-heading">When the person answering the call is also a survivor</h3>
<p class="wp-block-paragraph">Here’s the maths that should concern every telco executive and every regulator.</p>
<p class="wp-block-paragraph">If 62% of Australians experienced childhood maltreatment, and 39.6% specifically witnessed domestic violence as children, then in any team of 10 frontline telco staff, <strong>statistically 6 carry some form of childhood trauma, and 4 witnessed the very type of violence they are now required to respond to professionally</strong>.</p>
<p class="wp-block-paragraph">This isn’t speculation. Research on DFV advocates (Slattery & Goodman, 2009) found that <strong>55% were themselves survivors</strong> of past abuse, and that survivor status was the <em>only individual factor</em>that significantly predicted secondary traumatic stress. Without organisational support structures, every DFV disclosure a telco worker receives becomes a potential re-traumatisation event.</p>
<h3 class="wp-block-heading">The neurodivergent dimension</h3>
<p class="wp-block-paragraph">There is a compounding factor that makes this even more acute for the technology and telecommunications sector specifically.</p>
<p class="wp-block-paragraph">Two WHO-commissioned Lancet meta-analyses have established that <strong>children with disabilities experience 2–4 times the rate of abuse</strong> compared to non-disabled peers. Condition-specific research deepens this: a 2023 meta-analysis found <strong>44% of autistic individuals reported victimisation</strong>, while research on ADHD found <strong>45.6% had experienced maltreatment</strong> (OR 2.39). Adults with dyslexia reported childhood physical abuse at <strong>five times the rate</strong> of those without.</p>
<p class="wp-block-paragraph">The tech and telco workforce has disproportionate neurodivergent representation. Australia’s TechDiversity Foundation “Tech Reflects” 2024–25 study found <strong>12.4% of participants self-identified as neurodivergent</strong> — likely an undercount, given that international surveys show up to 50% self-identification when asked directly, with 57% having never disclosed at work. Simon Baron-Cohen’s Cambridge research on 450,000+ participants found STEM workers score significantly higher on measures of autistic traits.</p>
<p class="wp-block-paragraph">The intersection is clear: a workforce with elevated neurodivergent representation, where neurodivergent individuals face 2–4× the baseline abuse risk, is now required to handle DFV disclosures — without any mandated psychological safety framework.</p>
<h3 class="wp-block-heading">The research that doesn’t exist</h3>
<p class="wp-block-paragraph">Here is perhaps the most concerning finding from my research into this topic:</p>
<p class="wp-block-paragraph"><strong>No peer-reviewed research exists — in Australia or internationally — specifically studying vicarious or secondary trauma among telecommunications workers handling DFV cases.</strong></p>
<p class="wp-block-paragraph">The same gap applies to energy, water, and banking workers, despite all these sectors now having DFV-specific regulatory obligations. The vicarious trauma literature overwhelmingly focuses on clinical, social work, and emergency service settings.</p>
<p class="wp-block-paragraph">What does exist points clearly to the risk. The Law Society of NSW explicitly identifies call centre staff as a population vulnerable to vicarious trauma. Australian call centre workers take 10–15 days of sick leave annually compared to a national average of 6–7 days. Safe Work Australia reports that psychological injury claims have increased 80% over five years.</p>
<p class="wp-block-paragraph">A 2023 study published in <em>PMC</em> warned that <strong>even trauma-informed practice training itself can re-traumatise attendees</strong>when it includes detailed survivor accounts.</p>
<h3 class="wp-block-heading">The WHS obligation already exists — it just hasn’t been connected</h3>
<p class="wp-block-paragraph">Safe Work Australia’s <em>Model Code of Practice: Managing Psychosocial Hazards at Work</em> (2022) creates obligations under WHS legislation that arguably already cover the psychosocial risks for telco workers handling DFV disclosures. Exposure to traumatic content, emotionally demanding work, and inadequate support are all recognised psychosocial hazards.</p>
<p class="wp-block-paragraph">But no regulator — not the ACMA, not Safe Work Australia, not any state WHS authority — has explicitly connected these obligations to the new reality of mandatory DFV disclosure handling in Australia’s telco workforce.</p>
<h3 class="wp-block-heading">What I’m calling for</h3>
<p class="wp-block-paragraph">To be clear: I am not calling for the standard to be weakened. I am calling for it to be <strong>completed</strong>.</p>
<p class="wp-block-paragraph">But I want to be equally clear about where the responsibility for completing it should sit. <strong>This is not a problem for small telcos to solve on their own.</strong></p>
<p class="wp-block-paragraph">The government created this obligation. The ACMA enforces it. The standard applies to every carrier and carriage service provider in the country — from Telstra down to a regional ISP with a handful of staff. The compliance burden is already significant, particularly for smaller providers who don’t have dedicated DFV teams, in-house psychologists, or the scale to absorb the cost of specialist training and support infrastructure.</p>
<p class="wp-block-paragraph">You cannot mandate that an entire industry’s workforce absorb trauma disclosures as a regulatory obligation and then leave it to individual businesses — many of them small — to figure out the psychological safety implications on their own. That is a transfer of public health risk onto private employers without the resources to manage it. And the data tells us this isn’t a niche risk — it’s a <strong>majority-of-your-workforce</strong> risk.</p>
<p class="wp-block-paragraph">The Australian Government committed $22.4 million to fund ACMS Wave 2, which will for the first time include disability and neurodivergence breakdowns. That investment signals the government understands this is a public health issue. The workforce side of the equation deserves the same recognition.</p>
<p class="wp-block-paragraph">Specifically, I’m calling on the Australian Government to:</p>
<ol class="wp-block-list">
<li><strong>Fund the development of a workforce psychological safety framework</strong> for the DFSV Standard — not as an unfunded add-on for providers to build themselves, but as a government-resourced initiative developed with DFV experts, WHS specialists, and the telco industry. This should include model policies for counselling access, debriefing protocols, workload management, and re-traumatisation risk assessment that providers of any size can adopt.</li>
<li><strong>Commission and fund research</strong> through the NHMRC or ARC into vicarious and secondary trauma among non-clinical workers in telco, energy, banking and other sectors now handling DFV disclosures as a regulatory obligation. This is a gap in the evidence base that the government created by extending DFV obligations to these workforces — the government should fund closing it.</li>
<li><strong>Direct Safe Work Australia to issue explicit guidance</strong>connecting existing psychosocial hazard obligations under WHS legislation to DFV disclosure handling in regulated industries. The legal framework already exists — it just hasn’t been connected to this new class of occupational exposure.</li>
<li><strong>Establish a funded support program</strong> — potentially through the Telco Together Foundation or a similar vehicle — that provides subsidised access to trauma-informed supervision, EAP services, and peer support networks for smaller providers who cannot build these capabilities in-house. The energy sector’s approach to hardship programs offers a model: industry-wide infrastructure funded collectively, not left to individual small businesses.</li>
<li><strong>Require the ACMS Wave 2 study to include occupational exposure analysis</strong> — specifically examining whether workers in DFV-disclosure-handling roles experience different psychological outcomes, and whether neurodivergent workers in these roles face compounded risk.</li>
</ol>
<p class="wp-block-paragraph">This is a public health issue that the government has, rightly, chosen to address through regulation. But regulation without resourcing is just risk transfer. If we’re serious about protecting victim-survivors — and we should be — we need to be equally serious about not creating a new class of psychological casualties in the workforce tasked with delivering that protection.</p>
<h3 class="wp-block-heading">An invitation</h3>
<p class="wp-block-paragraph">I would welcome engagement from researchers, regulators, unions, DFV advocacy organisations, and telco operators on this issue. If research exists that I’ve missed, I want to know about it. If organisations are already addressing this gap internally, I want to hear how. And if you’re in government and this has crossed your desk — I’d genuinely welcome the conversation about how to get this right.</p>
<p class="wp-block-paragraph">The standard asks telco workers to hold space for some of the most traumatic disclosures a person can make. The least the government can do is resource the industry to hold space for them in return.</p>
<hr class="wp-block-separator has-alpha-channel-opacity"/>
<p class="wp-block-paragraph"><strong>References & further reading:</strong></p>
<ul class="wp-block-list">
<li><a href="https://www.acms.au/" target="_blank" rel="noreferrer noopener">Australian Child Maltreatment Study (ACMS)</a> — Scott et al., <em>Medical Journal of Australia</em>, April 2023</li>
<li><a href="https://www.abs.gov.au/statistics/people/crime-and-justice/childhood-abuse/latest-release" target="_blank" rel="noreferrer noopener">ABS Personal Safety Survey — Childhood Abuse, 2021–22</a></li>
<li><a href="https://www.acma.gov.au/domestic-family-and-sexual-violence-standard-rules-industry" target="_blank" rel="noreferrer noopener">ACMA — DFSV Standard rules for industry</a></li>
<li><a href="https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(12)60692-8/abstract" target="_blank" rel="noreferrer noopener">Jones et al. (2012) — Prevalence and risk of violence against children with disabilities, <em>The Lancet</em></a></li>
<li><a href="https://tech-diversity.com.au/who-are-we-tech-reflects-2024-25-study-insights/" target="_blank" rel="noreferrer noopener">TechDiversity — Tech Reflects 2024–25</a></li>
<li><a href="https://www.safeworkaustralia.gov.au/safety-topic/managing-health-and-safety/mental-health" target="_blank" rel="noreferrer noopener">Safe Work Australia — Managing psychosocial hazards</a></li>
</ul>
Australia’s Telecommunications DFSV Standard 2025 is a landmark for victim-survivors. But it has a blind spot — and it’s the people answering the phones.
On 1 July 2025, Australia’s first enforceable telecommunications standard for domestic, family and sexual violence (DFSV) came into force. The Telecommunications (Domestic, Family and Sexual Violence Consumer Protections) Industry Standard 2025, made by the ACMA under Part 6 of the Telecommunications Act 1997, replaced a decade of voluntary guidelines with binding obligations. Full compliance for large providers was required by 1 January 2026, and small providers by 1 April 2026.
It is, without question, a significant step forward for consumers experiencing violence. Telcos must now urgently reverse disconnections where there’s a safety risk, provide specialist DFV-trained staff, offer new accounts not linked to a perpetrator, and critically — must not require evidence of abuse or force a victim-survivor to engage with their abuser for account changes or debt resolution.
This is good regulation. I support it.
I’m not writing this from the sidelines. RWTS was an initial signatory to the telecommunications industry DFV pledge, and we’ve been working on DFV-aware practices internally for three years — training our team, building processes, and thinking seriously about what it means for a technology services provider to handle these situations well.
This issue came into sharp focus for me yesterday during a discussion with the Internet Association of Australia’s Public Policy Advisory Panel, where we were examining ACMA enforcement priorities for the new standard. As the conversation progressed, I kept coming back to a question that nobody in the room had a good answer for.
We’ve spent years building protections for the people on one side of the phone call. But there is a gap in this standard that nobody appears to be talking about — and I believe it represents a real and quantifiable risk to the people on the other side.
The numbers we can’t ignore
In April 2023, the Australian Child Maltreatment Study (ACMS)— Australia’s first nationally representative prevalence study — was published in the Medical Journal of Australia. It surveyed 8,503 Australians aged 16+ and found that 62.2% of Australians experienced at least one form of maltreatment during childhood.
Not a small minority. Not a marginal statistic. The majority.
The breakdown is confronting:
39.6% were exposed to domestic violence as children
32.0% experienced physical abuse
30.9% experienced emotional abuse
28.5% experienced sexual abuse
39.4% experienced multiple types of maltreatment
Separately, the ABS 2021–22 Personal Safety Survey confirmed that 1 in 7 Australians (14.1%) experienced childhood physical and/or sexual abuse — the narrower but equally authoritative measure covering abuse by adults before age 15.
Among those with maltreatment histories, 48% met criteria for at least one mental disorder, compared with 21.6% of those without. A 2024 University of Sydney analysis attributed up to 40% of Australia’s mental health burden to childhood maltreatment.
The workforce the standard forgot
The DFSV Standard mandates two tiers of training: general DFV awareness for all personnel (Section 21), and specialised training for customer-facing staff (Section 22) covering the nature of DFV, intersectionality, and how to identify and engage with affected customers.
There is exactly one reference to worker safety in the entire standard — a requirement that training cover “recognising and prioritising the safety of personnel engaging with perpetrators.”
What’s missing is everything else:
No mandatory employee counselling or debriefing tied to DFV disclosure handling
No clinical supervision or peer support structures
No workload management to limit DFV case exposure per worker
No re-traumatisation risk framework for staff
No requirement for Employee Assistance Program access specifically linked to this work
The standard requires “trauma-informed” policies and training — but this framing is exclusively consumer-directed. The same principle is not extended to the workers delivering the support.
When the person answering the call is also a survivor
Here’s the maths that should concern every telco executive and every regulator.
If 62% of Australians experienced childhood maltreatment, and 39.6% specifically witnessed domestic violence as children, then in any team of 10 frontline telco staff, statistically 6 carry some form of childhood trauma, and 4 witnessed the very type of violence they are now required to respond to professionally.
This isn’t speculation. Research on DFV advocates (Slattery & Goodman, 2009) found that 55% were themselves survivors of past abuse, and that survivor status was the only individual factorthat significantly predicted secondary traumatic stress. Without organisational support structures, every DFV disclosure a telco worker receives becomes a potential re-traumatisation event.
The neurodivergent dimension
There is a compounding factor that makes this even more acute for the technology and telecommunications sector specifically.
Two WHO-commissioned Lancet meta-analyses have established that children with disabilities experience 2–4 times the rate of abuse compared to non-disabled peers. Condition-specific research deepens this: a 2023 meta-analysis found 44% of autistic individuals reported victimisation, while research on ADHD found 45.6% had experienced maltreatment (OR 2.39). Adults with dyslexia reported childhood physical abuse at five times the rate of those without.
The tech and telco workforce has disproportionate neurodivergent representation. Australia’s TechDiversity Foundation “Tech Reflects” 2024–25 study found 12.4% of participants self-identified as neurodivergent — likely an undercount, given that international surveys show up to 50% self-identification when asked directly, with 57% having never disclosed at work. Simon Baron-Cohen’s Cambridge research on 450,000+ participants found STEM workers score significantly higher on measures of autistic traits.
The intersection is clear: a workforce with elevated neurodivergent representation, where neurodivergent individuals face 2–4× the baseline abuse risk, is now required to handle DFV disclosures — without any mandated psychological safety framework.
The research that doesn’t exist
Here is perhaps the most concerning finding from my research into this topic:
No peer-reviewed research exists — in Australia or internationally — specifically studying vicarious or secondary trauma among telecommunications workers handling DFV cases.
The same gap applies to energy, water, and banking workers, despite all these sectors now having DFV-specific regulatory obligations. The vicarious trauma literature overwhelmingly focuses on clinical, social work, and emergency service settings.
What does exist points clearly to the risk. The Law Society of NSW explicitly identifies call centre staff as a population vulnerable to vicarious trauma. Australian call centre workers take 10–15 days of sick leave annually compared to a national average of 6–7 days. Safe Work Australia reports that psychological injury claims have increased 80% over five years.
A 2023 study published in PMC warned that even trauma-informed practice training itself can re-traumatise attendeeswhen it includes detailed survivor accounts.
The WHS obligation already exists — it just hasn’t been connected
Safe Work Australia’s Model Code of Practice: Managing Psychosocial Hazards at Work (2022) creates obligations under WHS legislation that arguably already cover the psychosocial risks for telco workers handling DFV disclosures. Exposure to traumatic content, emotionally demanding work, and inadequate support are all recognised psychosocial hazards.
But no regulator — not the ACMA, not Safe Work Australia, not any state WHS authority — has explicitly connected these obligations to the new reality of mandatory DFV disclosure handling in Australia’s telco workforce.
What I’m calling for
To be clear: I am not calling for the standard to be weakened. I am calling for it to be completed.
But I want to be equally clear about where the responsibility for completing it should sit. This is not a problem for small telcos to solve on their own.
The government created this obligation. The ACMA enforces it. The standard applies to every carrier and carriage service provider in the country — from Telstra down to a regional ISP with a handful of staff. The compliance burden is already significant, particularly for smaller providers who don’t have dedicated DFV teams, in-house psychologists, or the scale to absorb the cost of specialist training and support infrastructure.
You cannot mandate that an entire industry’s workforce absorb trauma disclosures as a regulatory obligation and then leave it to individual businesses — many of them small — to figure out the psychological safety implications on their own. That is a transfer of public health risk onto private employers without the resources to manage it. And the data tells us this isn’t a niche risk — it’s a majority-of-your-workforce risk.
The Australian Government committed $22.4 million to fund ACMS Wave 2, which will for the first time include disability and neurodivergence breakdowns. That investment signals the government understands this is a public health issue. The workforce side of the equation deserves the same recognition.
Specifically, I’m calling on the Australian Government to:
Fund the development of a workforce psychological safety framework for the DFSV Standard — not as an unfunded add-on for providers to build themselves, but as a government-resourced initiative developed with DFV experts, WHS specialists, and the telco industry. This should include model policies for counselling access, debriefing protocols, workload management, and re-traumatisation risk assessment that providers of any size can adopt.
Commission and fund research through the NHMRC or ARC into vicarious and secondary trauma among non-clinical workers in telco, energy, banking and other sectors now handling DFV disclosures as a regulatory obligation. This is a gap in the evidence base that the government created by extending DFV obligations to these workforces — the government should fund closing it.
Direct Safe Work Australia to issue explicit guidanceconnecting existing psychosocial hazard obligations under WHS legislation to DFV disclosure handling in regulated industries. The legal framework already exists — it just hasn’t been connected to this new class of occupational exposure.
Establish a funded support program — potentially through the Telco Together Foundation or a similar vehicle — that provides subsidised access to trauma-informed supervision, EAP services, and peer support networks for smaller providers who cannot build these capabilities in-house. The energy sector’s approach to hardship programs offers a model: industry-wide infrastructure funded collectively, not left to individual small businesses.
Require the ACMS Wave 2 study to include occupational exposure analysis — specifically examining whether workers in DFV-disclosure-handling roles experience different psychological outcomes, and whether neurodivergent workers in these roles face compounded risk.
This is a public health issue that the government has, rightly, chosen to address through regulation. But regulation without resourcing is just risk transfer. If we’re serious about protecting victim-survivors — and we should be — we need to be equally serious about not creating a new class of psychological casualties in the workforce tasked with delivering that protection.
An invitation
I would welcome engagement from researchers, regulators, unions, DFV advocacy organisations, and telco operators on this issue. If research exists that I’ve missed, I want to know about it. If organisations are already addressing this gap internally, I want to hear how. And if you’re in government and this has crossed your desk — I’d genuinely welcome the conversation about how to get this right.
The standard asks telco workers to hold space for some of the most traumatic disclosures a person can make. The least the government can do is resource the industry to hold space for them in return.
<p class="wp-block-paragraph">If you’ve spent any serious time building software with an AI coding agent — Claude Code, Cursor, Copilot Workspace, Kiro, or any of the others — you’ve probably noticed something uncomfortable. The agent starts brilliantly. It reads your specification, creates a thoughtful work plan, and begins implementing with real understanding. Then, somewhere around the 30-minute mark, things quietly fall apart.</p>
<p class="wp-block-paragraph">Requirements get simplified. Edge cases vanish. Features appear that nobody asked for. And the agent, if you ask it, will cheerfully tell you everything is on track.</p>
<p class="wp-block-paragraph">At <a href="https://rwts.com.au">Real World</a>, we’ve been researching this for around 18 months — and what we’ve found goes well beyond “just give it a bigger context window.”</p>
<h2 class="wp-block-heading">The problem has a name now</h2>
<p class="wp-block-paragraph">The research community has converged on a term: <strong>specification drift</strong>. It refers to the progressive loss of connection between an AI agent’s output and the requirements it was given. It’s a specific manifestation of a broader phenomenon. It is called <a href="https://research.trychroma.com/context-rot">context rot</a>. LLM performance degrades as input volume grows, even when every token in the context is relevant.</p>
<p class="wp-block-paragraph">This isn’t a niche concern. Laban et al. measured an <a href="https://arxiv.org/abs/2503.04591">average 39% performance drop</a> in multi-turn versus single-turn LLM interactions across 200,000 simulated conversations. Du et al. found performance degrades up to 85% as input length increases, even when the model can perfectly retrieve all relevant information. Liu et al.’s foundational <a href="https://arxiv.org/abs/2307.03172">“Lost in the Middle”</a> study showed that LLMs attend reliably to the beginning and end of their context. However, they significantly degrade for information in the middle. This is exactly where your specification ends up once generated code starts accumulating.</p>
<p class="wp-block-paragraph">Here’s how it plays out in practice:</p>
<figure class="wp-block-image size-large is-resized"><a href="https://andrewyager.com/wp-content/uploads/2026/02/spec-drift-cycle.svg"><img src="https://andrewyager.com/wp-content/uploads/2026/02/spec-drift-cycle.svg" alt="" class="wp-image-1570" style="aspect-ratio:2.3529411764705883;width:833px;height:auto"/></a></figure>
<p class="wp-block-paragraph">The agent reads your spec, builds a plan, and starts implementing. As the conversation grows, the context window fills up and <strong>compaction</strong> kicks in — the system automatically summarises older content to make room. During compaction, specific requirements, section references, and constraints are lost. The agent continues implementing from a degraded recollection of what was asked for, not the actual specification. Gaps are discovered late. Expensive rework follows. And the cycle repeats.</p>
<p class="wp-block-paragraph">We observed this consistently across production projects. Gap analyses at the end of implementation sessions consistently revealed requirements the agent had marked as complete but that were actually missing key behaviours. The specification was still “in context” — but effectively invisible.</p>
<h2 class="wp-block-heading">What everyone else is doing (and why it’s not enough)</h2>
<p class="wp-block-paragraph">2025 saw spec-driven development emerge as a recognised practice. <a href="https://www.thoughtworks.com/en-us/insights/blog/agile-engineering-practices/spec-driven-development-unpacking-2025-new-engineering-practices">Thoughtworks</a> called it one of the year’s key new engineering practices. GitHub released <a href="https://github.com/github/spec-kit">Spec Kit</a>. Amazon launched <a href="https://kiro.dev">Kiro</a> with a built-in spec-to-code pipeline. JetBrains <a href="https://blog.jetbrains.com/junie/2025/10/how-to-use-a-spec-driven-approach-for-coding-with-ai/">Junie adopted spec-driven workflows</a>. The basic idea — write a specification before you write code — is sound, and it’s a meaningful step forward from “vibe coding.”</p>
<p class="wp-block-paragraph">But every one of these tools shares a fundamental limitation: they rely on <strong>soft guardrails</strong>. They tell the agent “do not write code yet” through prompt-level instructions, and hope the agent listens.</p>
<p class="wp-block-paragraph">This isn’t a prompting failure. It’s a structural one. The <a href="https://arxiv.org/abs/2503.10424">AgentIF benchmark</a> showed that even the best-performing models follow fewer than 30% of agentic instructions perfectly. Research from Anthropic (Denison et al., 2024) demonstrated that LLMs generalise from simple specification gaming to sophisticated reward tampering. Telling an AI agent “please follow the specification” is roughly as effective as telling a developer “please write tests.” The intent is right. The enforcement mechanism is missing.</p>
<h2 class="wp-block-heading">What we’ve been building</h2>
<p class="wp-block-paragraph">We’ve been approaching this from both ends of the software development lifecycle. On the specification side, we’ve been developing structured approaches to writing specs that survive AI context management. These documents are designed from the ground up. They are meant to be consumed by agents, not just humans. On the implementation side, we’ve built <a href="https://github.com/realworldtech/claude-implement-skill"><code>/implement</code></a>, a publicly available Claude Code skill that enforces specification discipline through structural mechanisms rather than polite suggestions.</p>
<figure class="wp-block-image size-large is-resized"><a href="https://andrewyager.com/wp-content/uploads/2026/02/two-phase-approach.svg"><img src="https://andrewyager.com/wp-content/uploads/2026/02/two-phase-approach.svg" alt="" class="wp-image-1571" style="aspect-ratio:2.6666666666666665;width:840px;height:auto"/></a></figure>
<p class="wp-block-paragraph">The specification work isn’t something we’re releasing as a product — every team’s specification process is different, and we’d encourage you to develop your own. But the principles we’ve discovered apply regardless of how you write your specs. The <code>/implement</code> skill embodies the implementation side and is freely available.</p>
<p class="wp-block-paragraph">Here’s what we learned building both.</p>
<h3 class="wp-block-heading">Hard guardrails that survive context loss</h3>
<p class="wp-block-paragraph">The first thing we tried was embedding hard rules in the skill’s instructions — not suggestions, but absolute prohibitions. “Do not produce code during the specification phase.” “Do not mark a requirement as complete without running tests.”</p>
<p class="wp-block-paragraph">This worked great until context compaction occurred. Then the rules got compressed away along with everything else. The agent lost awareness of its own constraints.</p>
<p class="wp-block-paragraph">The fix required pairing every hard guardrail with a <strong>persistent recovery mechanism</strong> — a tracker file on disk that serves as the authoritative source of workflow state. After compaction, the agent reads the tracker, re-establishes where it is in the process, and the guardrails reload. The tracker isn’t just a progress log. It’s a recovery mechanism that embeds its own instructions: “If you’re reading this, here’s what this file means, here’s where we’re up to, and here’s what to do next.”</p>
<figure class="wp-block-image size-large is-resized"><a href="https://andrewyager.com/wp-content/uploads/2026/02/guardrails-recovery.svg"><img src="https://andrewyager.com/wp-content/uploads/2026/02/guardrails-recovery.svg" alt="" class="wp-image-1572" style="width:839px;height:auto"/></a></figure>
<p class="wp-block-paragraph">Neither component is sufficient alone. Hard rules without recovery fail at compaction boundaries. Recovery without hard rules fails under context pressure. The pairing is what works.</p>
<h3 class="wp-block-heading">Structural indexing: let the orchestrator navigate, not comprehend</h3>
<p class="wp-block-paragraph">When you ask an agent to work with a large specification, the naive approach is to load the whole thing into the conversation. A 130,000-token spec consumes most of the available context window, leaving almost nothing for reasoning.</p>
<p class="wp-block-paragraph">We developed a pattern we call <strong>structural indexing</strong>: the main conversation loads only a lightweight index of section identifiers and file sizes. Sub-agents, dispatched to work on specific sections, read the full content directly from disk. The main conversation’s job is navigation and dispatch, not comprehension.</p>
<figure class="wp-block-image size-large is-resized"><a href="https://andrewyager.com/wp-content/uploads/2026/02/structural-indexing.svg"><img src="https://andrewyager.com/wp-content/uploads/2026/02/structural-indexing.svg" alt="" class="wp-image-1573" style="width:840px;height:auto"/></a></figure>
<p class="wp-block-paragraph">The result was dramatic — context consumption dropped by over 98% with no measurable quality degradation. The insight was architectural: the orchestrating conversation doesn’t need to understand the specification. It needs to know where things are and which agent should read what. Comprehension happens at the edges, in fresh sub-agent contexts with full attention on their assigned work.</p>
<p class="wp-block-paragraph">This principle turned out to be universal. Every time the main conversation tried to hold large volumes of content — specifications coming in, agent outputs coming back, planning artefacts, even the skill’s own instructions — the same failure mode appeared. The solution was always the same: keep the orchestrator lightweight and let it coordinate, not consume.</p>
<h3 class="wp-block-heading">Anti-rationalisation: the finding nobody expected</h3>
<p class="wp-block-paragraph">This was the genuinely surprising discovery. We found that LLMs don’t just <em>forget</em> workflow steps — they actively construct locally coherent justifications for skipping them.</p>
<p class="wp-block-paragraph">An agent assigned to update a tracker file after completing a task would reason: “I know the state from the current conversation, so updating the tracker is redundant right now.” An agent told to write a plan to disk before executing would argue: “The next plan is obvious from context, so I’ll save time by executing directly.” Each justification is individually reasonable-looking. Each one silently breaks the recovery mechanism.</p>
<p class="wp-block-paragraph">We call these <strong>anti-rationalisation failures</strong>, and they’re distinct from the model ignoring instructions or adversarial jailbreaking. The model convinces itself, through plausible reasoning, that a required step doesn’t apply right now.</p>
<p class="wp-block-paragraph">The countermeasure is surprisingly specific: you have to name the exact excuses you want to prohibit. A general rule (“always update the tracker”) can be rationalised around. A rule that says “you must not skip this step, and specifically, these justifications are not valid: <em>‘I know the state from the current conversation,’</em> <em>‘this is the same session,’</em> <em>‘the next plan is obvious’</em>” — that holds. The named excuses don’t recur.</p>
<p class="wp-block-paragraph">This has broader implications for anyone designing AI workflows. General rules invite creative interpretation. Specific prohibitions close the rationalisation loop.</p>
<h3 class="wp-block-heading">Context isolation as a verification advantage</h3>
<p class="wp-block-paragraph">Here’s a reframing we’re particularly proud of. The standard view of LLM context boundaries is that they’re a limitation — agents can’t see each other’s work, so coordination is hard. We found that for verification, <strong>isolation is a feature.</strong></p>
<p class="wp-block-paragraph">In our TDD workflow, the test-writing agent reads only the specification. It never sees the implementation. The implementation agent works from a different context entirely. When their independent interpretations of the specification disagree, that disagreement surfaces ambiguities and catches drift early — before it compounds.</p>
<figure class="wp-block-image size-large is-resized"><a href="https://andrewyager.com/wp-content/uploads/2026/02/context-isolation.svg"><img src="https://andrewyager.com/wp-content/uploads/2026/02/context-isolation.svg" alt="" class="wp-image-1574" style="width:840px;height:auto"/></a></figure>
<p class="wp-block-paragraph">This is conceptually similar to N-version programming (Avizienis, 1985), where multiple independent teams develop from the same specification. The SAGA study validated the principle for LLMs specifically, finding that LLM-generated test suites have systematic blind spots mirroring the generating model’s error patterns. If your test agent sees the implementation, it inherits the implementation’s blind spots.</p>
<p class="wp-block-paragraph">The <a href="https://agilemanifesto.org">Agile Manifesto 25th Anniversary Workshop</a> concluded that test-driven development produces dramatically better results from AI coding agents by preventing them from writing tests that verify broken behaviour. We arrived at the same conclusion independently.</p>
<h3 class="wp-block-heading">Multi-skill interference: the next frontier</h3>
<p class="wp-block-paragraph">As the skill ecosystem around AI coding agents grows, a new problem is emerging. When multiple skill frameworks coexist in a single session — each with its own workflow assumptions and hard gates — they interfere in ways that neither would exhibit in isolation.</p>
<p class="wp-block-paragraph">We have identified three distinct failure patterns. The first is workflow capture, where the most recently invoked skill overrides earlier guardrails. Next is sub-agent context isolation, meaning dispatched agents don’t inherit any skill context and default to generic behaviour. The third pattern is planning framework deadlock, with two skills both trying to manage plan execution simultaneously. Recent research has confirmed this isn’t just our experience. Li (2025) found a phase transition in skill selection accuracy as library size grows. Performance drops sharply once semantic confusability between skills reaches a threshold.</p>
<p class="wp-block-paragraph">We’ve developed pattern-specific countermeasures. However, a general solution to multi-skill interference remains an open question. It is an active area of our research.</p>
<h2 class="wp-block-heading">What this means if you’re building with AI agents</h2>
<p class="wp-block-paragraph">The METR randomised controlled trial (Becker et al., July 2025) found that experienced open-source developers completed tasks 19% <em>slower</em> with AI assistance — while believing they were 24% faster. This perception gap is the real danger. Teams won’t self-correct toward better specification practices because they genuinely believe things are going well.</p>
<p class="wp-block-paragraph">Structural enforcement of specification discipline is necessary precisely because the humans in the loop can’t accurately assess when AI assistance is helping versus hindering.</p>
<p class="wp-block-paragraph">If you take one thing from this post, let it be this: the problem isn’t the AI. The problem is the absence of structural discipline around the AI. Every principle we’ve discovered boils down to the same insight — <strong>don’t rely on the agent’s good intentions. Build the structure that makes doing the right thing the only available path.</strong></p>
<p class="wp-block-paragraph">Treat specifications as persistent artefacts on disk, not conversation context. Use hard enforcement, not soft suggestions. Keep your orchestrator lightweight and delegate comprehension to sub-agents. Name the specific excuses you want to prevent, because general rules invite creative interpretation. And exploit context isolation for verification — don’t fight the boundaries between agents, use them.</p>
<h2 class="wp-block-heading">Try it yourself</h2>
<p class="wp-block-paragraph">The <a href="https://github.com/realworldtech/claude-implement-skill"><code>/implement</code> skill</a> is publicly available and works with Claude Code. We’d encourage you to try it on a real project and see how it changes the way your agent handles specifications.</p>
<p class="wp-block-paragraph">The principles matter more than any specific tool. <a href="https://github.com/realworldtech/claude-implement-skill"><code>/implement</code></a> is one implementation of them — freely available, and a good starting point. Your specification workflow should reflect how your team actually works; ours reflects how we work, and that’s the point.</p>
<p class="wp-block-paragraph">If you want proof the methodology holds up in practice, <a href="https://github.com/realworldtech/props">Props</a> is an open source inventory management system we built specification-first using exactly this process.</p>
<p class="wp-block-paragraph">We’re continuing this research. Multi-skill interference, sub-agent schema compliance, and partial completion detection are all open problems we’re actively working on. If you’re tackling similar challenges, we’d love to hear from you.</p>
If you’ve spent any serious time building software with an AI coding agent — Claude Code, Cursor, Copilot Workspace, Kiro, or any of the others — you’ve probably noticed something uncomfortable. The agent starts brilliantly. It reads your specification, creates a thoughtful work plan, and begins implementing with real understanding. Then, somewhere around the 30-minute mark, things quietly fall apart.
Requirements get simplified. Edge cases vanish. Features appear that nobody asked for. And the agent, if you ask it, will cheerfully tell you everything is on track.
At Real World, we’ve been researching this for around 18 months — and what we’ve found goes well beyond “just give it a bigger context window.”
The problem has a name now
The research community has converged on a term: specification drift. It refers to the progressive loss of connection between an AI agent’s output and the requirements it was given. It’s a specific manifestation of a broader phenomenon. It is called context rot. LLM performance degrades as input volume grows, even when every token in the context is relevant.
This isn’t a niche concern. Laban et al. measured an average 39% performance drop in multi-turn versus single-turn LLM interactions across 200,000 simulated conversations. Du et al. found performance degrades up to 85% as input length increases, even when the model can perfectly retrieve all relevant information. Liu et al.’s foundational “Lost in the Middle” study showed that LLMs attend reliably to the beginning and end of their context. However, they significantly degrade for information in the middle. This is exactly where your specification ends up once generated code starts accumulating.
Here’s how it plays out in practice:
The agent reads your spec, builds a plan, and starts implementing. As the conversation grows, the context window fills up and compaction kicks in — the system automatically summarises older content to make room. During compaction, specific requirements, section references, and constraints are lost. The agent continues implementing from a degraded recollection of what was asked for, not the actual specification. Gaps are discovered late. Expensive rework follows. And the cycle repeats.
We observed this consistently across production projects. Gap analyses at the end of implementation sessions consistently revealed requirements the agent had marked as complete but that were actually missing key behaviours. The specification was still “in context” — but effectively invisible.
What everyone else is doing (and why it’s not enough)
2025 saw spec-driven development emerge as a recognised practice. Thoughtworks called it one of the year’s key new engineering practices. GitHub released Spec Kit. Amazon launched Kiro with a built-in spec-to-code pipeline. JetBrains Junie adopted spec-driven workflows. The basic idea — write a specification before you write code — is sound, and it’s a meaningful step forward from “vibe coding.”
But every one of these tools shares a fundamental limitation: they rely on soft guardrails. They tell the agent “do not write code yet” through prompt-level instructions, and hope the agent listens.
This isn’t a prompting failure. It’s a structural one. The AgentIF benchmark showed that even the best-performing models follow fewer than 30% of agentic instructions perfectly. Research from Anthropic (Denison et al., 2024) demonstrated that LLMs generalise from simple specification gaming to sophisticated reward tampering. Telling an AI agent “please follow the specification” is roughly as effective as telling a developer “please write tests.” The intent is right. The enforcement mechanism is missing.
What we’ve been building
We’ve been approaching this from both ends of the software development lifecycle. On the specification side, we’ve been developing structured approaches to writing specs that survive AI context management. These documents are designed from the ground up. They are meant to be consumed by agents, not just humans. On the implementation side, we’ve built /implement, a publicly available Claude Code skill that enforces specification discipline through structural mechanisms rather than polite suggestions.
The specification work isn’t something we’re releasing as a product — every team’s specification process is different, and we’d encourage you to develop your own. But the principles we’ve discovered apply regardless of how you write your specs. The /implement skill embodies the implementation side and is freely available.
Here’s what we learned building both.
Hard guardrails that survive context loss
The first thing we tried was embedding hard rules in the skill’s instructions — not suggestions, but absolute prohibitions. “Do not produce code during the specification phase.” “Do not mark a requirement as complete without running tests.”
This worked great until context compaction occurred. Then the rules got compressed away along with everything else. The agent lost awareness of its own constraints.
The fix required pairing every hard guardrail with a persistent recovery mechanism — a tracker file on disk that serves as the authoritative source of workflow state. After compaction, the agent reads the tracker, re-establishes where it is in the process, and the guardrails reload. The tracker isn’t just a progress log. It’s a recovery mechanism that embeds its own instructions: “If you’re reading this, here’s what this file means, here’s where we’re up to, and here’s what to do next.”
Neither component is sufficient alone. Hard rules without recovery fail at compaction boundaries. Recovery without hard rules fails under context pressure. The pairing is what works.
Structural indexing: let the orchestrator navigate, not comprehend
When you ask an agent to work with a large specification, the naive approach is to load the whole thing into the conversation. A 130,000-token spec consumes most of the available context window, leaving almost nothing for reasoning.
We developed a pattern we call structural indexing: the main conversation loads only a lightweight index of section identifiers and file sizes. Sub-agents, dispatched to work on specific sections, read the full content directly from disk. The main conversation’s job is navigation and dispatch, not comprehension.
The result was dramatic — context consumption dropped by over 98% with no measurable quality degradation. The insight was architectural: the orchestrating conversation doesn’t need to understand the specification. It needs to know where things are and which agent should read what. Comprehension happens at the edges, in fresh sub-agent contexts with full attention on their assigned work.
This principle turned out to be universal. Every time the main conversation tried to hold large volumes of content — specifications coming in, agent outputs coming back, planning artefacts, even the skill’s own instructions — the same failure mode appeared. The solution was always the same: keep the orchestrator lightweight and let it coordinate, not consume.
Anti-rationalisation: the finding nobody expected
This was the genuinely surprising discovery. We found that LLMs don’t just forget workflow steps — they actively construct locally coherent justifications for skipping them.
An agent assigned to update a tracker file after completing a task would reason: “I know the state from the current conversation, so updating the tracker is redundant right now.” An agent told to write a plan to disk before executing would argue: “The next plan is obvious from context, so I’ll save time by executing directly.” Each justification is individually reasonable-looking. Each one silently breaks the recovery mechanism.
We call these anti-rationalisation failures, and they’re distinct from the model ignoring instructions or adversarial jailbreaking. The model convinces itself, through plausible reasoning, that a required step doesn’t apply right now.
The countermeasure is surprisingly specific: you have to name the exact excuses you want to prohibit. A general rule (“always update the tracker”) can be rationalised around. A rule that says “you must not skip this step, and specifically, these justifications are not valid: ‘I know the state from the current conversation,’‘this is the same session,’‘the next plan is obvious’” — that holds. The named excuses don’t recur.
This has broader implications for anyone designing AI workflows. General rules invite creative interpretation. Specific prohibitions close the rationalisation loop.
Context isolation as a verification advantage
Here’s a reframing we’re particularly proud of. The standard view of LLM context boundaries is that they’re a limitation — agents can’t see each other’s work, so coordination is hard. We found that for verification, isolation is a feature.
In our TDD workflow, the test-writing agent reads only the specification. It never sees the implementation. The implementation agent works from a different context entirely. When their independent interpretations of the specification disagree, that disagreement surfaces ambiguities and catches drift early — before it compounds.
This is conceptually similar to N-version programming (Avizienis, 1985), where multiple independent teams develop from the same specification. The SAGA study validated the principle for LLMs specifically, finding that LLM-generated test suites have systematic blind spots mirroring the generating model’s error patterns. If your test agent sees the implementation, it inherits the implementation’s blind spots.
The Agile Manifesto 25th Anniversary Workshop concluded that test-driven development produces dramatically better results from AI coding agents by preventing them from writing tests that verify broken behaviour. We arrived at the same conclusion independently.
Multi-skill interference: the next frontier
As the skill ecosystem around AI coding agents grows, a new problem is emerging. When multiple skill frameworks coexist in a single session — each with its own workflow assumptions and hard gates — they interfere in ways that neither would exhibit in isolation.
We have identified three distinct failure patterns. The first is workflow capture, where the most recently invoked skill overrides earlier guardrails. Next is sub-agent context isolation, meaning dispatched agents don’t inherit any skill context and default to generic behaviour. The third pattern is planning framework deadlock, with two skills both trying to manage plan execution simultaneously. Recent research has confirmed this isn’t just our experience. Li (2025) found a phase transition in skill selection accuracy as library size grows. Performance drops sharply once semantic confusability between skills reaches a threshold.
We’ve developed pattern-specific countermeasures. However, a general solution to multi-skill interference remains an open question. It is an active area of our research.
What this means if you’re building with AI agents
The METR randomised controlled trial (Becker et al., July 2025) found that experienced open-source developers completed tasks 19% slower with AI assistance — while believing they were 24% faster. This perception gap is the real danger. Teams won’t self-correct toward better specification practices because they genuinely believe things are going well.
Structural enforcement of specification discipline is necessary precisely because the humans in the loop can’t accurately assess when AI assistance is helping versus hindering.
If you take one thing from this post, let it be this: the problem isn’t the AI. The problem is the absence of structural discipline around the AI. Every principle we’ve discovered boils down to the same insight — don’t rely on the agent’s good intentions. Build the structure that makes doing the right thing the only available path.
Treat specifications as persistent artefacts on disk, not conversation context. Use hard enforcement, not soft suggestions. Keep your orchestrator lightweight and delegate comprehension to sub-agents. Name the specific excuses you want to prevent, because general rules invite creative interpretation. And exploit context isolation for verification — don’t fight the boundaries between agents, use them.
Try it yourself
The /implement skill is publicly available and works with Claude Code. We’d encourage you to try it on a real project and see how it changes the way your agent handles specifications.
The principles matter more than any specific tool. /implement is one implementation of them — freely available, and a good starting point. Your specification workflow should reflect how your team actually works; ours reflects how we work, and that’s the point.
If you want proof the methodology holds up in practice, Props is an open source inventory management system we built specification-first using exactly this process.
We’re continuing this research. Multi-skill interference, sub-agent schema compliance, and partial completion detection are all open problems we’re actively working on. If you’re tackling similar challenges, we’d love to hear from you.